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INSULIN AND IGF-1 RECEPTOR AGONISTS AND ANTAGONISTS 

This application is a continuation-in-part of U.S. Application Serial No. 
5 09/962,756 filed September 24, 2001 , which is a continuation-in-part of U.S. 
Application Serial No. 09/538,038 filed March 29, 2000, which is a 
continuation-in-part of U.S. Application Serial No. 09/146,127, filed 
September 2, 1998, all of which are incorporated herein by reference in their 
entirety. 

1 0 I. FIELD OF THE INVENTION 

This invention relates to the field of hormone receptor activation or 
inhibition. More specifically, this invention relates to the identification of 
molecular structures, especially peptides, which are capable of acting at 
either the insulin or insulin-like growth factor receptors as agonists or 
15 antagonists. Also related to this invention is the field of molecular modeling 
whereby useful molecular models are derived from known structures. 

II. BACKGROUND OF THE INVENTION 

Insulin is a potent metabolic and growth promoting hormone that acts 
on cells to stimulate glucose, protein, and lipid metabolism, as well as RNA 

20 and DNA synthesis. A well-known effect of insulin is the regulation of 
glucose levels in the body. This effect occurs predominantly in liver, fat, and 
muscle tissue. In the liver, insulin stimulates glucose incorporation into 
glycogen and inhibits the production of glucose. In muscle and fat tissue, 
insulin stimulates glucose uptake, storage, and metabolism. Defects in 

25 glucose utilization are very common in the population, giving rise to 
diabetes. 

Insulin initiates signal transduction in target cells by binding to a 
specific cell-surface receptor, the insulin receptor (IR). The binding leads to 
conformational changes in the extracellular domain of IR, which are 
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transmitted across the cell membrane and result in activation of the 
receptor's tyrosine kinase activity. This, in turn, leads to 
autophosphorylation of tyrosine kinase of IR, and the binding of soluble 
effector molecules that contain SH2 domains such as phophoinositol-3- 
5 kinase, Ras GTPase-activating protein, and phospholipase Cy to IR (Lee 
and Pilch, 1994, Am. J. Physiol. 266:C319-C334). 

Insulin-like growth factor 1 (IGF-1) is a small, single-chain protein 
(MW = 7,500 Da) that is involved in many aspects of tissue growth and 
repair. It is similar in size, sequence, and structure to insulin, but has 100- 

10 1,000-fold lower affinity for IR (Mynarcik et a/., 1997, J. Biol. Chem. 
272:18650-18655). Although IGF-1 mRNA can be detected in many tissues, 
the majority of circulating IGF-1 is produced in the liver after stimulation by 
growth hormone (Butt et at., 1999, Immunol. Cell Biol. 77:256-262). 
Functionally, IGF-1 appears to act as a mitogen and as an anti-apoptotic 

15 factor for cells. 

Recent studies have analyzed the role of endogenous IGF-1 in 
various disease states. Several reports have shown that IGF-1 promotes 
the growth of normal and cancerous prostate cells both in vitro and in vivo 
(Angelloz-Nicoud and Binoux, 1995, Endocrinol. 136:5485-5492; Figueroa et 

20 a/., 1995, J. Clin. Endocrinol. Metab. 80:3476-3482; Torring et a/., 1997, J. 
Urol. 158:222-227). Elevated serum levels of IGF-1 have been shown to be 
associated with increased risks of prostate cancer, and may be an earlier 
predictor of onset than prostate-specific antigen (PSA; J.M. Chan et al., 
1998, Science 279:563-566). Serum levels of free IGF-1 are regulated by 

25 the presence of IGF binding proteins (IGFBP), which bind to IGF-1 and 
prevent its interaction with the IGF-1 R (reviewed in C.A. Conover, 1996, 
Endocr. J. 43S:S43-S48; Rajaram et al., 1997, Endocr. Rev. 18:801-831). 
PSA has been shown to be a protease that cleaves IGFBP-3, resulting in an 
increase of free IGF-1 in serum (P. Cohen et al., 1992, J. Clin. Endocrinol. 

30 Metab. 75:1046-1053; P. Cohen era/., 1994, J. Endocrinol. 142:407-415; H. 
Lilja, 1995, Scand. J. Clin. Lab. Invest. Suppl. 220:47-56). Consistent with 
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this finding, men with higher levels of circulating IGF-1 and lower levels of 
IGFBP-3 were found to be at higher risk for developing colorectal cancer (J. 
Ma et al., 1999, J. Natl. Cancer Instit. 91:620-625.). Recent studies have 
also shown a connection between IGF-1 levels and ovarian cancer. 
5 There also appears to be a relationship between high levels of IGF-1 

and/or IGF-1 R and breast cancer (L.C. Happerfield et a/., 1997, J. Pathol. 
183:412-417). A positive correlation was observed between circulating IGF- 
1 and breast cancer among pre-menopausal women (S.E. Hankinson et al., 
1998, Lancet 351:1393-1396). A poor prognosis for breast cancer patients 

10 was correlated to the expression of IGF-1 R positive and estrogen receptor 
(ER) negative cells (A.A. Butler et al., 1998, Cancer Res. 58:3021-3027). 
Recently, investigators have identified hybrid IGF-1 R/IR receptors found in 
several breast cancer cell lines (G. Pandini et al., 1999, Clin. Cancer Res. 
5:1935-1944; E.M. Bailyes et al., 1997, Biochem. J. 327(Pt 1):209-215; see 

15 below). The data has suggested that these hybrids behave as functional 
IGF-1 Rs and may play a major role in IGF-1 signaling in breast cancer. 

Clinical studies have also investigated the use of recombinant human 
IGF-1 in the treatment of several diseases, including type I diabetes (Carroll 
et al., 1997, Diabetes 46:1453-1458; Crowne et al., 1998, Metabolism 

20 47:31-38), amyotropic lateral sclerosis (Lai et al., 1997, Neurology 49: 1621 - 
1630), and diabetic motor neuropathy (Apfel and Kessler, 1996, CIBA 
Found. Symp. 196:98-108). Other potential therapeutic applications of IGF- 
1, such as osteoporosis (Canalis, 1997, Bone 21:215-216), immune 
modulation (Clark, 1997, Endocr. Rev. 18:157-179) and nephrotic syndrome 

25 (Feld and Hirshberg, 1996, Pediatr. Nephrol. 10:355-358) are also under 
investigation. Clearly, IGF-1 R activity is involved in many disease states, 
indicating that there are potential clinical applications for both IGF-1 agonists 
and antagonists. 

Both insulin and IGF-1 are expressed as precursor proteins 
30 comprising, among other regions, contiguous A, B, and C peptide regions, 
with the C peptide being an intervening peptide connecting the A and B 
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peptides. A mature insulin molecule is composed of the A and B chains 
connected by disulfide bonds, where the connecting C peptide has been 
removed during post-translational processing. IGF-1 retains its smaller C- 
peptide as well as a small D extension at the C-terminal end of the A chain, 
5 making the mature IGF-1 slightly larger than insulin (Blakesley, 1996). The 
C region of human IGF-1 appears to be required for high affinity binding to 
IGF-1 R (Pietrzkowski et a/., 1992, Cancer Res. 52(23):6447-51). 
Specifically, tyrosine 31 located within this region appears to be essential for 
high affinity binding. Furthermore, deletion of the D domain of IGF-1 

10 increased the affinity of the mutant IGF-1 for binding to the IR, while 
decreasing its affinity for the IGF-1 R (Pietrzkowski et ai, 1992). A further 
distinction between the two hormones is that, unlike insulin, IGF-1 has very 
weak self-association and does not hexamerize (De Meyts, 1994). 

IGF-1 and insulin competitively cross-react with IGF-1 R and IR (L. 

15 Schaffer, 1994, Eur. J. Biochem. 221:1127-1132). Yet, despite 45% overall 
amino acid identity, insulin and IGF-1 bind only weakly to each other's 
receptor. The affinity of each peptide for the non-cognate receptor is about 
3 orders of magnitude lower than that for the cognate receptor (Mynarcik, et 
ai, 1997, J. Biol. Chem. 272:18650-18655). The differences in binding 

20 affinities may be partly explained by the differences in amino acids and 
unique domains which contribute to unique tertiary structures of ligands 
(Blakesley era/., 1996, Cytokine Growth Factor Rev. 7(2):153-9). 

IGF-1 R and IR are related members of the tyrosine-kinase receptor 
superfamily of growth factor receptors. Another family member is insulin- 

25 related receptor (IRR), for which no natural ligand is known. Both IGF-1 R 
and IR are comprised of two a and two p subunits which form a disulfide- 
linked heterotetramer (p-a-a-p). These receptors have an extracellular 
ligand binding domain, a single transmembrane domain, and a cytoplasmic 
domain displaying the tyrosine kinase activity. The extracellular domain is 

30 composed of the entire a subunits and a portion of the N-terminus of the p 
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subunits, while the intracellular portion of the p subunits contains the 
tyrosine kinase domain. In contrast to other tyrosine kinase receptors, IGF- 
1R, IR and IRR exist on the cell surface as disulphide-linked dimers and 
require domain rearrangements rather than receptor oligomerization for cell 
5 signaling (Adams et ai, 2000, Cell. Mol. Life Sci. 57:1050-1093; Garrett et 
a/., 1998, Nature 394:395-399; Frasca et ai, 1999, Mol. Cell Biol. 19: 3278- 
3288; De Meyts et ai, 1994, Hormone Res. 42:152-169). In addition, insulin 
and IGF-1 hemireceptors (comprising one a subunit and one p subunit) can 
heterodimerize to form IR/IGF-1R hybrids (M.A. Soos et a/., 1990, Biochem. 

10 J. 270:383-390; J. Kasua et ai, 1993, Biochemistry 32:13531-13536; B.L. 
Seely etai, 1995, Endocrinology 136:1635-1641). 

In many cells, IR/IGF-1R hybrids are the most common receptor 
subtype (Bailyes et ai, 1997, Biochem. J. 327(pt.1):209-215). The 
proportion of total IGF-1 R assembled into hybrids varies between 40% and 

15 60% in human tissues (M. Federici et ai, 1997, Mol. Cell. Endocrin. 
129(2):121-6). IR/IGF-1R hybrids are also overproduced in human cancer 
cells as a result of overexpression of IR and IGF-1 R (Pandini et ai, 1999, 
Clin. Cancer Res. 5:1935-1944; A. Belfiore et ai, 1999, Biochemie, 
81(4):403-7; V. Papa etai, 1990, J. Clin. Invest. 86:1503-1510; V. Papa et 

20 ai, 1993, Cancer Res. 53:3736-3740). In particular, increased levels of 
IR/IGF-1R hybrids have been observed in breast cancer cell lines and 
breast cancer tissue specimens (Pandini et ai, 1999, Clin. Cancer Res. 
5:1935-1944). Similarly, high levels of IR/IGF-1R hybrids have been 
observed in thyroid cancer specimens and cell lines (A. Belfiore et ai, 1999, 

25 Biochemie, 81(4):403-7). Functional studies have indicated that IR/IGF-1R 
hybrids are predominantly activated by IGF-1 (M.A. Soos ef ai, 1993, 
Biochem. J. 290(pt.2):4 19-426; A.L. Frattali et ai, 1993, J. Biol. Chem. 
268:7393-7400). Accordingly, it has been postulated that IR/IGF-1R hybrids 
provide additional binding sites for IGF-1, and thereby increase cell 

30 sensitivity to this factor (Bailyes et ai, 1997, Biochem. J. 327(pt.1):209-215; 



WO 03/027246 PCT/US02/30412 



-6- 

Pandini et a/., 1999, Clin. Cancer Res. 5:1935-1944; A. Belfiore eta!., 1999, 
Biochemie, 81(4):403-7). 

IR is a glycoprotein having molecular weight of 350-400 kDa 
(depending of the level of glycosylation). It is synthesized as a single 
5 polypeptide chain and proteolytically cleaved to yield a disulfide-linked 
monomer a-p insulin receptor. Two a-p monomers are linked by disulfide 
bonds between the a-subunits to form a dimeric form of the receptor (P-a-a- 
p-type configuration). The a subunit is comprised of 723 amino acids, and it 
can be divided into two large homologous domains, L1 (amino acids 1-155) 

10 and L2 (amino acids 313-468), separated by a cysteine-rich region (amino 
acids 156-312) (Ward et al., 1995, Prot. Struct Funct. Genet. 22:141-153). 
Many determinants of insulin binding seem to reside in the a-subunit. The 
p-subunit of IR has 620 amino acid residues and three domains: 
extracellular, transmembrane, and cytosolic. The extracellular domain is 

15 linked by disulfide bridges to the a-subunit. The cytosolic domain includes 
the tyrosine kinase domain, the three-dimensional structure of which has 
been solved (Hubbard et al., 1994, Nature 372:746-754). A unique feature 
of IR is that it is dimeric in the absence of ligand. 

To aid in drug discovery efforts, a soluble form of a membrane-bound 

20 receptor was constructed by replacing the transmembrane domain and the 
intracellular domain of IR with constant domains from immunoglobulin Fc or 
X subunits (Bass et al., 1996, J. Biol. Chem. 271:19367-19375). The 
recombinant gene was expressed in human embryonic kidney 293 cells. 
The expressed protein was a fully processed heterotetramer and the ability 

25 to bind insulin was similar to that of the full-length holoreceptor. 

IGF-1R is synthesized as a 180 kDa precursor which is glycosylated, 
dimerized and proteolytically processed to yield mature receptor (T.E. 
Adams et al., 2000, Cell. Mol. Life Sci., 57:1050-1093, 2000). The mature 
receptor/complex consists of two extracellular a-subunits and two 

30 transmembrane p-subunits having tyrosine kinase activity. IGF-1R is 
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expressed in almost all normal adult tissue except for liver, which is itself the 
major site of IGF-1 production (Butt ef a/., 1999, Immunol. Cell Biol. 77:256- 
262). A variety of signaling pathways are activated following binding of IGF- 
1 to the IGF-1 R, including Src and ras, as well as downstream pathways, 
5 such as the MAP kinase cascade and the PI3K/AKT axis (Chow et ai, 1998, 
J. Biol. Chem. 273:4672-4680). 

The sequence of IR is highly homologous to the sequence of IGF-1 R, 
indicating that the three-dimensional structures of both receptors may be 
similar. The oc-subunits, which contain the ligand binding region of IR and 

10 IGF-1 R, exhibit between 47-67% overall amino acid identity. Three general 
domains, termed L1, cysteine-rich, and L2, have been reported for both 
receptors from sequence analysis of the a subunits. The cysteine residues 
in the cysteine-rich region are highly conserved between the two receptors; 
however, the cysteine-rich regions share only 48% overall amino acid 

15 identity. Notably, the crystal structure of the first three domains of IGF-1 R 
has been determined (Garrett ef a/., 1998, Nature 394:395-399). The L 
domains consist of a single-stranded right-handed p-helix (a helical 
arrangement of p-strands), while the cysteine-rich region is composed of 
eight disulfide-bonded modules. 

20 While similar in structure, IGF-1 R and IR serve different physiological 

functions. IR is primarily involved in metabolic functions whereas IGF-1 R 
mediates growth and differentiation. Consistent with this, ablation of IGF-1 
(i.e., in IGF-1 knock-out mice) results in embryonic growth deficiency, 
impaired postnatal growth, and infertility. In addition, IGF-1 R knock-out 

25 mice were only 45% of normal size and died of respiratory failure at birth 
(Liu et ai, 1993, Cell 75:59-72). However, both insulin and IGF-1 can 
induce both mitogenic and metabolic effects. Whether each ligand elicits 
both activities via its own receptor, or whether insulin exerts its mitogenic 
effects through its weak affinity binding to IGF-1 R, and IGF-1 its metabolic 
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effects through IR, remains controversial (De Meyts, 1994, Horm. Res. 
42:152-169). 

Also, despite the similarities observed between these two receptors, 
the role of the domains in specific ligand binding are distinct. Through 
5 chimeric receptor studies, (domain swapping of the IR and IGF-1R a- 
subunits), researchers have reported that the sites of interaction of the 
ligands with their specific receptors differ (T. Kjeldsen et a/., 1991, Proc. 
Natl. Acad. Sci. USA 88:4404-4408; A.S. Andersen et a/., 1992, J. Biol. 
Chem. 267:13681-13686). For example, the cysteine-rich domain of the 

10 IGF-1R was determined to be essential for high-affinity IGF binding, but not 
insulin binding. When amino acids 191-290 of IGF-1R region was 
introduced into the corresponding region of the IR (amino acids 198-300), 
the modified IR bound both IGF-1 and insulin with high affinity. Conversely, 
when the corresponding region of the IR was introduced into the IGF-1 R, the 

1 5 modified IGF-1 R bound to IR but not IGF-1 . 

A further distinction between the binding regions of the IR and IGF- 
1R is their differing dependence on the N-terminal and C-terminal regions. 
Both the N-terminal and C-terminal regions (located within the putative L1 
and L2 domains) of the IR are important for high-affinity insulin binding but 

20 appear to have little effect on IGF-1 binding for either IR or IGF-1 R. 
Replacing residues in the N-terminus of IGF-1 R (amino acids 1-62) with the 
corresponding residues of IR (amino acids 1-68) confers insulin-binding 
ability on IGF-1 R. Within this region, residues Phe-39, Arg-41 and Pro-42 
are reported as major contributors to the interaction with insulin (Williams et 

25 a/., 1995). When these residues are introduced into the equivalent site of 
IGF-1 R, the affinity for insulin is markedly increased, whereas, substitution 
of these residues by alanine in IR results in markedly decreased insulin 
affinity. Similarly, the region between amino acids 704-717 of the C- 
terminus of IR has been shown to play a major role in insulin specificity. 

30 Substitution of these residues with alanine also disrupts insulin binding 
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(Mynarcik et a/., 1996, J. Biol. Chem. 271(5):2439-42; C. Kristensen et a/., 
1999, J. Biol. Chem. 274(52):37351 -37356). 

Alanine scans of IR and IGF-1R suggest that insulin and IGF-1 may 
use some common contacts to bind to IGF-1 R but that those contacts differ 
5 from those that insulin utilizes to bind to IR (Mynarcik et a/., 1997). Hence, 
the data in the literature has led one commentator to state that even though 
"the binding interfaces for insulin and IGF-1 on their respective receptors 
may be homologous within this interface the side chains which make actual 
contact and determine specificity may be quite different between the two 

1 0 ligand-receptor systems" (De Meyts, 1 994). 

Based on data for binding of insulin and insulin analogs to various 
insulin receptor constructs, a binding model has been proposed. This model 
shows insulin receptor with two insulin binding sites that are positioned on 
two different surfaces of the receptor molecule, such that each alpha-subunit 

15 is involved in insulin binding. In this way, activation of the insulin receptor is 
believed to involve cross-connection of the alpha-subunits by insulin. A 
similar mechanism may operate for IGF-1 R, but one of the receptor binding 
interactions appears to be different (Schaffer, 1994, Eur. J. Biochem. 
221:1127-1132). 

20 The identification of molecular structures having a high degree of 

specificity for one or the other receptor is important to developing efficacious 
and safe therapeutics. For example, a molecule developed as an insulin 
agonist should have little or no IGF-1 activity in order to avoid the mitogenic 
activity of IGF-1 and a potential for facilitating neoplastic growth. It is 

25 therefore important to determine whether insulin and IGF-1 share common 
three-dimensional structures but which have sufficient differences to confer 
selectivity for their respective receptors. Similarly, it would be desirable to 
identify other molecular structures that mimic the active binding regions of 
insulin and/or IGF-1 and which impart selective agonist or antagonist 

30 activity. 
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Although certain proteins are important drugs, their use as 
therapeutics presents several difficult problems, including the high cost of 
production and formulation, administration usually via injection and limited 
stability in the bloodstream. Therefore, replacing proteins, including insulin 
5 or IGF-1 , with small molecular weight drugs has received much attention. 
However, to date, none of these efforts has resulted in finding an effective 
drug replacement. 

Peptides mimicking functions of protein hormones have been 
previously reported. Yanofsky et al. (1996, Proc. Natl. Acad. Sci USA 

10 93:7381-7386) reported the isolation of a monomer antagonistic to IL-1 with 
nanomolar affinity for the IL-1 receptor. This effort required construction and 
use of many phage displayed peptide libraries and sophisticated phage- 
panning procedures. 

Wrighton et al. (1996, Science 273:458-464) and Livnah et al. (1996, 

1 5 Science 273:464-471 ) reported dimer peptides that bind to the erythropoietin 
(EPO) receptor with full agonistic activity in vivo. These peptides are 
cyclical and have intra-peptide disulfide bonds; like the IL-1 receptor 
antagonist, they show no significant sequence identity to the natural ligand. 
Importantly, X-ray crystallography revealed that it was the spontaneous 

20 formation of non-covalent peptide homodimer peptides that enabled the 
dimerization two EPO receptors. 

WO 96/04557 reported the identification of peptides and antibodies 
that bound to active sites of biological targets, which were subsequently 
used in competition assays to identify small molecules that acted as agonist 

25 or antagonists at the biological targets. Renchler et al. (1994, Proc. Natl. 
Acad. Sci. USA 91:3623-3627) reported synthetic peptide ligands of the 
antigen binding receptor that induced programmed cell death in human B- 
cell lymphoma. 

Most recently, Cwirla et al. (1997, Science 276:1696-1698) reported 
30 the identification of two families of peptides that bound to the human 
thrombopoietin (TPO) receptor and were competed by the binding of the 
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natural TPO ligand. The peptide with the highest affinity, when dimerized by 
chemical means proved to be as potent an in vivo agonist as TPO, the 
natural ligand. 

III. SUMMARY OF THE INVENTION 

5 This invention relates to the identification of amino acid sequences 

that specifically recognize sites involved in IR or IGF-1R activation. Specific 
amino acid sequences are identified and their agonist or antagonist activity 
at IR and/or IGF-1R has been determined. Such sequences may be 
developed as potential therapeutics or as lead compounds to develop other 

10 more efficacious ones. In addition, these sequences may be used in high- 
throughput screens to identify and provide information on small molecules 
that bind at these sites and mimic or antagonize the functions of insulin or 
IGF-1. Furthermore, the peptide sequences provided by this invention can 
be used to design secondary peptide libraries, which can be used to identify 

15 sequence variants that increase or otherwise modulate the binding and/or 
activity of the original peptide at IR or IGF-1 R. The peptide sequences of 
the invention can also be combined to make dimer or other multimeric 
peptides, which can be used for screening, diagnostic, and thereapeutic 
applications as described herein. 

20 In one aspect of this invention, large numbers of peptides have been 

screened for their IR and IGF-1 R binding and activity characteristics. 
Analysis of their amino acid sequences has identified certain consensus 
sequences which may be used themselves or as core sequences in larger 
amino acid sequences conferring upon them agonist or antagonist activity. 

25 Several generic amino acid sequences are disclosed which bind IR and/or 
IGF-1 R with varying degrees of agonist or antagonist activity depending on 
the specific sequence of the various peptides identified within each motif 
group. Also provided are amino or carboxyl terminal extensions capable of 
modifying the affinity and/or pharmacological activity of the consensus 

30 sequences when part of a larger amino acid sequence. Further provided 
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are peptides containing more than one consensus sequence (e.g., dimer 
peptides). 

The amino acid sequences of this invention which bind IR and/or IGF- 
1R include: 

5 a. X<[ X 2 X 3 X4 X 5 wherein X i? X 2f X4 and X 5 are aromatic amino 

acids, and X 3 is any polar amino acid (Formula 1; Group 1; A6 motif); 

b. X6X7X8X9X10X11 X12X13 wherein X 6 and X 7 are aromatic 
amino acids, Xs, X 9 , Xn and X 12 are any amino acid, and X 10 and X 13 are 
hydrophobic amino acids (Formula 2; Group 3; B6 motif); 
10 c. X 14 X 15 X 16 X17 Xie X 19 X 2 o X21 wherein Xu, and X i7 are 

hydrophobic amino acids, X 15 , X 16 , Xi 8 and X 19 are any amino acid, and X 2 o 
and X 2 i are aromatic amino acids (Formula 3; reverse B6; revB6). 

d. X 22 X 23 X 2 4 X 2 5 X 2 6 X 2 7 X 2 8 X 29 X30 X31 X3 2 X33 X34 X35 X36 X37 X 3 8 

X39 X40 X41 wherein X 22 , X 25 , X 28 , X 29 , X 30 , X 33 , X34, X 35 , X 3 e, X 37 , X 38 , X40, and 
15 X^ are any amino acid, X 35 and X 37 may be any amino acid for binding to 
IR, whereas X35 is preferably a hydrophobic amino acid and X 37 is preferably 
glycine for binding to IGF-1R and possess agonist or antagonist activity. X 23 
and X 2 6 are hydrophobic amino acids. This sequence further comprises at 
least two cysteine residues, preferably at X 25 and X40 X 31 and X 32 are small 
20 amino acids (Formula 4; Group 7; E8 motif). 

e. X4 2 X4 3 X44 X45 X46 X47 X48 X4 9 X50 X51 X52 X§ 3 X54 X55 Xs$ X57 X58 
X59 X 6 o X 6 i wherein X4 2 , X4 3 , X44, X45, X 53 , X 55 , X 56 , X 58l X 6 o and X 6 i may be 
any amino acid, X4 3> X46, X49, X 50 , X54 are hydrophobic amino acids, X47 and 
X 59 are preferably cysteines, X48 is a polar amino acid, and X51, X 52 and X 57 

25 are small amino acids (Formula 5; mini F8 motif). 

f. X62 X63 X64 X65 X66 X67 X68 X69 X70 X71 X7 2 X73 X74 X75 X76 X77 X78 

X7 9 xso Xsi wherein X62, X65, X68, Xe 9 , X71, X73, X76, X77, X78, Xeo, and Xei may 
be any amino acid; X 63 , X 7 o, X74 are hydrophobic amino acids; X64 is a polar 
amino acid, X 6 7 and X75 are aromatic amino acids and X 72 and X 79 are 
30 preferably cysteines capable of forming a loop (Formula 6; Group 2; D8 
motif). 
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g. H Xs2 Xe3 X84 X86 Xs7 Xa8 Xs9 X90 X91 X92 wherein Xs2 is 
proline or alanine, Xs3 is a small amino acid, Xs4 is selected from leucine, 
serine or threonine, Xq 5 is a polar amino acid, Xse, Xsa, Xe9 and X90 are any 
amino acid, and Xq 7 , X91 and X 92 are an aliphatic amino acid (Formula 7). 
5 h. X104 X105 X106 X107 X108 X109 Xno Xm X112 X113 X114 wherein at 

least one of the amino acids of X106 through X 111( and preferably two, are 
tryptophan separated by three amino acids, and wherein at least one of X104, 
X105 and X106 and at least one of Xn 2 , Xn 3 and X 114 are cysteine (Formula 
8); and 

10 i. an amino acid sequence comprising the sequence JBA5: 

DYKDLCQSWGVRIGWLAGLCPKK (SEQ ID NO:1541) or JBA5 minus 
FLAG® tag and terminal lysines: LCQSWGVRIGWLAGLCP (SEQ ID 
NO: 1542) (Formula 9). 

j. W X 123 G Y X124 W X 125 Xi26 (SEQ ID NO: 1543) wherein X i23 is 

15 selected from proline, glycine, serine, arginine, alanine or leucine, but more 
preferably proline; X 12 4 is any amino acid, but preferably a charged or 
aromatic amino acid; X125 is a hydrophobic amino acid preferably leucine or 
phenylalanine, and most preferably leucine. X126 is any amino acid, but 
preferably a small amino acid (Formula 10; Group 6 motif). 

20 In one embodiment, peptides comprising a preferred amino acid 

sequence FYX 3 WF (SEQ ID NO:1544) (Formula 1; Group 1; A6 motif) have 
been identified which competitively bind to sites on IR. Surprisingly, 
peptides comprising an amino acid sequence FYX 3 WF (SEQ ID NO: 1544) 
can possess agonist or antagonist activity at IR or IGF-1 R. 

25 This invention also identifies at least two distinct binding sites on IR 

and IGF-1 R (Site 1 and Site 2) based on the differing ability of certain of the 
peptides to compete with one another and ligand for binding to IR or IGF- 
1R. Accordingly, this invention provides amino acid sequences that bind 
specifically to one or both sites of IR or IGF-1 R. Furthermore, specific 

30 amino acid sequences are provided which have agonist or antagonist 
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characteristics based on their ability to bind to the specific sites of I R or IGF- 
1R. 

In another embodiment of this invention, amino acid sequences which 
bind to one or more sites of IR or IGF-1R (e.g., Site 1 or Site 2) are 
5 covalently linked together to form multivalent ligands. These multivalent 
ligands are capable of forming complexes with a plurality of IR or IGF-1R. 
Either the same or different amino acid sequences are covalently bound 
together to form homo- or heterocomplexes. 

In various aspects of the invention, monomer subunits are covalently 

10 linked at their N-termini or C-termini to form N-N, C-C, N-C, or C-N linked 
dimer peptides. In one example, dimer peptides are used to form receptor 
complexes bound through the same corresponding sites, e.g., Site 1-Site 1 
or Site 2-Site 2 dimers. Alternatively, heterodimer peptides are used to bind 
to different sites on one receptor or to cause receptor complexing through 

15 different sites, e.g., Site 1-Site 2 or Site 2-Site 1 dimers. In one novel aspect 
of the invention, Site 2-Site 1 dimers find use as insulin agonists, while 
certain Site 1-Site 2 dimers find use as insulin antagonists. 

In various embodiments, insulin agonists comprise Site 1-Site 1 dimer 
peptide sequences S325, S332, S333, S335, S337, S353, S374-S376, 

20 S378, S379, S381, S414, S415, and S418; whereas other insulin agonists 
comprise Site 2-Site 1 dimer peptide sequences S455, S457, S458, S467, 
S468, S471, S499, S510, S518, S519, and S520, as described herein 
below. In one preferred embodiment, an insulin agonist comprises the 
sequence of the S519 dimer peptide, which shows insulin-like activity in both 

25 in vitro and in vivo assays. 

The present invention also provides assays for identifying compounds 
that mimic the binding characteristics of insulin or IGF-1. Such compounds 
may act as antagonists or agonists of insulin or IGF-1 function in cell based 
assays. 
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This invention further provides kits for identifying compounds that 
bind to IR and/or IGF-1 R. Also provided are therapeutic compounds that 
bind the insulin receptor or the IGF-1 receptor. 

Other embodiments of this invention are the nucleic acid sequences 
5 encoding the amino acid sequences of the invention. Also within the scope 
of this invention are vectors containing the nucleic acids and host cells 
which express the nucleic acids encoding the amino acid sequences which 
bind at IR and/or IGF-1 R and possess agonist or antagonist activity. 

This invention also provides amino acid sequences that bind to active 
10 sites of IR and/or IGF-1 R and to identify structural criteria for conferring 
agonist or antagonist activity at IR or IGF-1 R. 

This invention further provides specific amino acid sequences that 
possess agonist, partial agonist, or antagonist activity at either IR or IGF-1 R. 
Such amino acid sequences are potentially useful as therapeutics 
15 themselves or may be used to identify other molecules, especially small 
organic molecules, which possess agonist or antagonist activity at IR or IGF- 
1R. 

In addition, the present invention provides structural information 
derived from the amino acid sequences of this invention, which may be used 
20 to construct other molecules possessing the desired activity at the relevant 
IR binding site. 

IV. BRIEF DESCRIPTION OF THE DRAWINGS 

Figures 1A-10; 2A-2E; 3A-3E; 4A-4I; 43A-43B, 44A-44B: Amino acid 
sequences identified by panning peptide libraries against IGF-1 R and/or IR. 

25 The amino acids are represented by their one-letter abbreviation. The ratios 
over background are determined by dividing the signal at 405 nm (E-Tag, 
IGF-1 R, or IR) by the signal at 405 nm for non-fat milk. The IGF-1 R/IR Ratio 
Comparison is determined by dividing the ratio of IGF-1 R by the ratio of IR. 
The IR/IGF-1R Ratio Comparison is determined by dividing the ratio of IR by 

30 the ratio of IGF-1 R. HIT indicates binder; CAND indicates binder candidate; 
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LDH indicates binding to lactate dehydrogenase (negative control); Sp/lrr 
indicates the ratio of specific binding over non-specific binding. 

The design of each library is shown in the first line in bold. In the 
design, symbol 'X 1 indicates a random position, an underlined amino acid 
5 indicates a doped position at the nucleotide level, and other positions are 
held constant. Additional abbreviations in the B6H library are: 'O' indicates 
an NGY codon where Y is C or T; 'J 1 indicates an RHR codon where R is A 
or G, and H is A, C, or T; and 'IT indicates an WY codon where V is A, C, or 
G, and Y is C or T. The 'h 1 in the 20E2 libraries indicates an NTN codon. 
10 Symbols in the listed sequences include: Q indicates a position 

corresponding to a TAG stop codon; # indicates a position corresponding to 
a TAA stop codon; * indicates a position corresponding to a TGA stop 
codon; and ? indicates an unknown amino acid. It is believed that a W 
replaces the TGA stop codon when expressed. The Q residues represent 
15 translation read-through at TAG stop codons. Except for the 20C and A6L 
libraries, all libraries are designed with the short FLAG® epitope DYKD 
(SEQ ID NO:1545; Hopp et a/., 1988, Bio/Technology 6:1205-1210) at the 
N-terminus of the listed sequence and AAAGAP (SEQ ID NO:1546) at the 
C-terminus. The 20C and A6L libraries have the full length FLAG® epitope 
20 DYKDDDDK (SEQ ID NO:1547). 

Figure 1A: Formula 1 motif peptide sequences obtained from a 
random 40mer library panned against IR (SEQ ID NOS:1-3). 

Figure 1B: Formula 1 motif peptide sequence obtained from a 
random 40mer library panned against IGF-1R (SEQ ID NOS:4-6). 
25 Figure 1C: Formula 1 motif peptide sequences obtained from a 

random 20mer library panned against IR (SEQ ID NOS:7-29). 

Figure 1D: Formula 1 motif peptide sequences obtained from a 
random 20mer library panned against IGF-1R (SEQ ID NOS:30-33). 

Figure 1E: Formula 1 motif peptide sequences obtained from a 
30 21mer library constructed to contain X^oNFYDWFVX^i (SEQ ID NO:34; 
also referred to as "A6S") panned against IR (SEQ ID NOS:35-98). 
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Figure 1 F: Formula 1 motif peptide sequences obtained from a 
21mer library constructed to contain X^oNFYDWFVXis-21 (SEQ ID NO:34; 
also referred to as "A6S") panned against IGF-1R (SEQ ID NOS:99-166). 

Figure 1G: Formula 1 motif peptide sequences obtained from a 
5 library constructed to contain variations outside the consensus core of the 
A6 peptide as indicated (referred to as "A6L" (SEQ ID NO:167)) panned 
against IR (SEQ ID NOS:168-216). 

Figure 1 H: Formula 1 motif peptide sequences obtained from a 
library constructed to contain variations outside the consensus core of the 
10 A6 peptide as indicated (referred to as "A6L" (SEQ ID NO:167)) panned 
against IGF-1R (SEQ ID NOS:21 7-244). 

Figure 11: Formula 1 motif peptide sequences obtained from a 
library constructed to contain variations in the consensus core of the E4D 
peptide (SEQ ID NO:245) (as indicated) panned against IR (SEQ ID 
15 NOS:246-305). 

Figure 1J: Formula 1 motif peptide sequences obtained from a 
library constructed to contain variations in the consensus core of the E4D 
peptide (SEQ ID NO:245) (as indicated) panned against IGF-1R (SEQ ID 
NOS:306-342). 

20 Figure 1K: Formula 1 motif peptide sequences obtained from a 

library constructed using the sequence X1-6FHENFYDWFVRQVSX21-26 
(SEQ ID NO:343; H2C-A) panned against IR (SEQ ID NOS:344-430). 

Figure 1 L: Formula 1 motif peptide sequences obtained from a 
library constructed using the sequence Xi.6FHENFYDWFVRQVSX21.26 
25 (SEQ ID NO:343; H2C-A) panned against IGF-1 R (SEQ ID NOS:431-467). 

Figure 1M: Formula 1 motif peptide sequences obtained from a 
library constructed using the sequence X1.eFHXXFYXWFX1s.21 (SEQ ID 
NO:468; H2C-B) and panned against IR (SEQ ID NOS:469-575). 

Figure 1N: Formula 1 motif peptide sequences obtained from a 
30 library constructed using the sequence XLeFHXXFYXWFX^i (SEQ ID 
NO:468; H2C-B) and panned against IGF-1 R (SEQ ID NOS:576-657). 
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Figure 10: Formula 1 motif peptide sequences obtained from other 
libraries panned against IR (SEQ ID NOS:658-712). 

Figure 2A: Formula 4 motif peptide sequences identified from a 
random 20mer library panned against IR (SEQ ID NO:713). 
5 Figure 2B: Formula 4 motif peptide sequences identified from a 

library constructed to contain variations in the F8 peptide (SEQ ID NO:713) 
as indicated (15% dope; referred to as "F815") panned against IR (SEQ ID 
NOS:714-796). 

Figure 2C: Formula 4 motif peptide sequences identified from a 
10 library constructed to contain variations in the F8 peptide (SEQ ID NO:713) 
as indicated (15% dope; referred to as "F815") panned against IGF-1R 
(SEQ ID NOS:797-811). 

Figure 2D: Formula 4 motif peptide sequences identified from a 
library constructed to contain variations in the F8 peptide (SEQ ID 
15 NO:713)as indicated (20% dope; referred to as "F820") panned against IR 
(SEQ ID NOS:812-861). 

Figure 2E: Formula 4 motif peptide sequences identified from other 
libraries panned against IR (SEQ ID NOS:862-925). 

Figure 3A: Formula 6 motif peptide sequences identified from a 
20 random 20mer library and panned against IR (SEQ ID NOS:926-928). 

Figure 3B: Formula 6 motif peptide sequences identified from a 
library constructed to contain variations in the D8 peptide (SEQ ID NO:929) 
as indicated (15% dope; referred to as "D815") panned against IR (SEQ ID 
NOS:930-967). 

25 Figure 3C: Formula 6 motif peptide sequences identified from a 

library constructed to contain variations in the D8 peptide (SEQ ID NO:929) 
as indicated (20% dope; referred to as "D820") panned against IR (SEQ ID 
NOS:968-1010). 

Figure 3D: Formula 6 motif peptide sequences identified from a 
30 library constructed to contain variations in the D8 peptide (SEQ ID NO:929) 
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as indicated (20% dope; referred to as "D820") panned against IGF-1R 
(SEQ ID NOS:101 1-1059). 

Figure 3E: Formula 6 motif peptide sequences identified from other 
libraries panned against IR (SEQ ID NOS: 1060-1 061). 
5 Figure 4A: Formula 10 motif peptide sequences identified from 

random 20mer libraries panned against IGF-1R (SEQ ID NOS:1 062-1 077). 

Figure 4B: Formula 10 motif peptide sequences identified from 
random 20mer libraries panned against IR (SEQ ID NOS: 1078-1 082). 

Figure 4C: Miscellaneous peptide sequences identified from a 
10 random 20mer library panned against IR (SEQ ID NOS: 1083-1 086). 

Figure 4D: Miscellaneous peptide sequences identified from a 
random 40mer library panned against IR (SEQ ID NOS: 1087-1 088). 

Figure 4E: Miscellaneous peptide sequences identified from a 
random 20mer library panned against IGF-1R (SEQ ID NOS: 1089-1 092). 
15 Figure 4F: Miscellaneous peptide sequences identified from an X1-4 

C X6-20 library and panned against IGF-1R (SEQ ID NOS:1093-1113). 

Figure 4G: Miscellaneous peptide sequences identified from a 
library constructed to contain variations of the F8 peptide(SEQ ID NO:1114) 
as indicated (F815) panned against IGF-1R (SEQ ID NOS:1 115-1 118). 
20 Figure 4H: Miscellaneous peptide sequences identified from a 

library constructed to contain variations in the F8A11 peptide(SEQ ID 
NO:1119) as indicated (referred to as "NNKH") panned against IR (SEQ ID 
NOS:1 120-1 142). 

Figure 41: Miscellaneous peptide sequences identified from a 
25 library constructed to contain variations in the F8A11 peptide(SEQ ID 
NO:1119) as indicated (referred to as "NNKH") panned against IGF-1R 
(SEQ ID NOS:1 143-1 154). 

Figure 5A: Summary of specific representative amino acid 
sequences from Formulas 1,4, 6, and 10 (SEQ ID NOS:1 155-1 180). 
30 Figure 5B: Summary of specific representative amino acid 

sequences from Formulas 1, 4, 6, and 10 (SEQ ID NOS: 1 181-1220). 
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Figure 6: Illustration of 2 binding site domains on IR based on 
competition data. 

Figure 7: Schematic illustration of potential binding schemes to 
the multiple binding sites on IR. 
5 Figure 8: Biopanning results and sequence alignments of Group 

1 of IR-binding peptides (SEQ ID NOS:1221-1243). The number of 
sequences found is indicated on the right side of the figure together with 
data on the phage binding to either IR or IGF-1R receptor. Absorbance 
signals are indicated by: ++++, >30X over background; +++, 15-30X; ++, 5- 
10 15X;+, 2-5X;andO, <2X. 

Figures 9A-9B: Biopanning results and sequence alignments of 
Groups 2, 6, and 7 of IR-binding peptides (SEQ ID NOS: 1244-1 261). The 
number of sequences found is indicated on the right side of the figure 
together with data on the phage binding to either IR or IGF-1R receptor. 
15 Absorbance signals are indicated by: ++++, >30X over background; +++, 
15-30X; ++, 5-1 5X; +, 2-5X; and 0, <2X. 

Figures 10A-10C: Insulin competition data determined for various 
monomer and dimer peptides. Figure 10A shows the competition curve. 
Figure 10B shows the symbol key for the peptides. Figure 10C shows the 
20 description of the peptides. 

Figures 11 A-11D: Insulin competition data determined for various 
monomer and dimer peptides. Figure 11A shows the competition curve. 
Figure 11B shows the symbol key for the peptides. Figure 11C shows the 
description of the peptides. Figure 11D shows IR binding affinity for the 
25 peptides. 

Figures 12A-1 2D: Results of free fat cell assays for truncated 
synthetic RP9 monomer peptides, S390 and S394. Figure 12A shows the 
results for peptide S390. Figure 12B shows the results for peptide S394. 
Figure 12C shows the amino acid sequence of peptides S390 and S394 
30 (SEQ ID NOS.1794 and 1788, respectively in order of appearance). Figure 
12D shows the results for full-length RP9 peptide. 
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Figures 13A-13C: Results of free fat cell assays for truncated 
synthetic RP9 dimer peptides, S415 and S417. Figure 13A shows the 
results for peptide S415. Figure 13B shows the results for peptide S417. 
Figure 13C shows the amino acid sequence of peptides S415 and S417 
5 (SEQ ID NOS:1 795-1 796). 

Figures 14A-14C: Results of free fat cell assays for RP9 
homodimer peptides, 521 and 535. Figure 14A shows the results for 
peptide 521. Figure 14B shows the results for peptide 535. Figure 14C 
shows the amino acid sequence of peptides 521 and 535. 
10 Figures 15A-15C: Results of free fat cell assays for RP9-D8 

heterodimer peptides, 537 and 538. Figure 15A shows the results for 
peptide 537. Figure 15B shows the results for peptide 538. Figure 15C 
shows the amino acid sequence of peptides 537 and 538. 

Figures 16A-16C: Results of free fat cell assays for RP9-D8 
15 heterodimer peptides 537 and 538. Figure 16A shows the results for 
peptide 537. Figure 16B shows the results for peptide 538. Figure 16C 
shows the amino acid sequence of peptides 537 and 538. 

Figures 17A-17B: Results of free fat cell assays for D8-RP9 
heterodimer peptide, 539. Figure 17A shows the results for peptide 539. 
20 Figure 17B shows the amino acid sequence of peptide 539. 

Figures 18A-18D: Results of free fat cell assays for Site 1/Site 2 
dimer peptides with constituent monomer peptides with Site 1-Site 2 C-N 
(Figure 18A), Site 1-Site 2, N-N (Figure 18B), Site 1-Site 2, C-C (Figure 
18C), and Site 2-Site 1, C-N (Figure 18D) orientations and linkages, 
25 respectively. 

Figures 19A-19B: Results of human insulin receptor kinase assays 
for various monomer and dimer peptides. Figure 19A shows the substrate 
phosphorylation curve. Figure 19B shows the EC 50 values. 

Figures 20A-20B: Results of human insulin receptor kinase assays 
30 for Site 1-Site 2 and Site 2-Site 1 dimer peptides. Figure 20A shows the 
substrate phosphorylation curve. Figure 20B shows the EC 50 values. 
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Figures 21A-21B: Results of human insulin receptor kinase assays 
for Site 1-Site 2 and Site 2-Site 1 peptides. Figure 21 A shows the substrate 
phosphorylation curve. Figure 21 B shows the EC50 values. 

Figures 22A-22B: Results of time-resolved fluorescence resonance 
5 transfer assays for assessing the ability of various monomer and dimer 
peptides to compete with biotinylated RP9 monomer peptide for binding to 
soluble human insulin receptor-immunoglobulin heavy chain chimera. 
Figure 22A shows the binding curve. Figure 22B shows the symbol key and 
description of the peptide sequences (SEQ ID NOS:2117, 1916-1917, 1558, 

10 1994, 1960-1961, 2008, 1794, 2015-2016, 1560, and 2001-2002, 
respectively in order of appearance). 

Figures 23A-23C: Results of time-resolved fluorescence resonance 
transfer assays indicating the ability of various monomer and dimer peptide 
to compete with biotinylated S175 monomer peptide or biotinylated RP9 

15 monomer peptide for binding to soluble human insulin receptor- 
immunoglobulin heavy chain chimera. Figures 23A-23B show the binding 
curves. Figure 23C shows the symbol key and description of the peptide 
sequences (SEQ ID NOS:2117, 1916-1917, 1558, 1994, 1960-1961, 2008, 
1794, 2015-2016, 1560, and 2001-2002, respectively in order of 

20 appearance). 

Figures 24A-24B: Results of fluorescence polarization assays 
indicating the ability of various monomer and dimer peptide to compete with 
fluorescein labeled RP9 monomer peptide for binding to soluble human 
insulin receptor ectodomain. Figure 24A shows the binding curve. Figure 

25 24B shows the symbol key and description of the peptide sequences (SEQ 
ID NOS:2117, 1916-1917, 1558, 1994, 1960-1961, 2008, 1794, 2015-2016, 
1560 and 2001-2002, respectively in order of appearance). 

Figures 25A-25B: Results of fluorescence polarization assays 
indicating the ability of various monomer and dimer peptides to compete 

30 with fluorescein labeled RP9 monomer peptide for binding to soluble human 
insulin mini-receptor. Figure 25A shows the binding curve. Figure 25B 
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shows the symbol key and description of the peptide sequences (SEQ ID 
NOS:2117, 1916-1917, 1558, 1994, 1960-1961, 2008, 1794, 2015-2016, 
1560, and 2001-2002, respectively in order of appearance). 

Figures 26A-26B: Results of fluorescence polarization assays 
5 indicating the ability of various monomer and dimer peptides to compete 
with fluorescein labeled insulin for binding to soluble human insulin receptor 
ectodomain. Figure 26A shows the binding curve. Figure 26B shows the 
symbol key and description of the peptide sequences (SEQ ID NOS:2117, 
1916-1917, 1558, 1994, 1960-1961, 2008, 1794, 2015-2016, 1560, and 

1 0 2001 -2002, respectively in order of appearance). 

Figures 27A-27B: Results of fluorescence polarization assays 
indicating the ability of various monomer and dimer peptides to compete 
with fluorescein labeled insulin for binding to soluble human insulin mini- 
receptor. Figure 27A shows the binding curve. Figure 27B shows the 

15 symbol key and description of the peptide sequences (SEQ ID NOS:2117, 
1916-1917, 1558, 1994, 1960-1961, 2008, 1794, 2015-2016, 1560, and 
2001-2002, respectively in order of appearance). 

Figure 28: A schematic drawing for the construction of protein 
fusions of the maltose binding protein. 

20 Figure 29: BIAcore analysis of competition binding between IR and 

maltose binding protein fusion peptides H2C-9aa-H2C, H2C, and H2C-3aa- 
H2C. 

Figure 30: Stimulation of IR autophosphorylation in vivo by maltose 
binding protein fusion peptides. 
25 Figures 31 A-31C: Results of free fat cell assays for insulin and Site 

2-Site 1 peptides, S519 and S520. Figure 31 A shows the results for S519. 
Figure 31 B shows the results for S520. Figure 31 C shows the EC 50 values. 

Figures 32A-32B: Results of human insulin receptor kinase assays 
for insulin and Site 2-Site 1 peptides S519 and S520. Figure 32A shows the 
30 substrate phosphorylation curve. Figure 32B shows the calculated Bestfit 
values. 
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Figure 33: Results of in vivo experiments showing the effect of 
intravenous administration of Site 2-Site 1 peptide S519 in Wistar rats: 

Figures 34A-34E: Results of phage competition studies with IGF-1 
peptides RP9 (Site 1) and D8 15 (Site 2). Phage: RP9 (A6-like); RP6 (B6- 
5 like): D8B 12 (Site 2); and D8 15 (Site 2); Peptides: RP9 and D815. Figures 
34A-34B show the competition curves. Figures 34C-34E show the symbol 
keys and peptide groups. 

Figure 35A-35E: Phage competition studies with Site 2-Site 1 
dimer peptides containing 6- or 12-amino acid linkers. Phage: RP9, RP6, 
10 D8B12, and D815; Peptides: D815-6L-RP9 and D815-12L-RP9. Figures 
35A-35B show the competition curves. Figures 35C-35E show the symbol 
keys and peptide groups. 

Figure 36: Results of IGF-1 agonist assay using FDC-P2 cells. Site 1 
peptides RP6, RP9, G33, and Site 2 peptide D815 were tested in the 
1 5 agonist assay. 

Figure 37: Results of IGF-1 antagonist assay using FDCP-2 cells. 
Site 1 peptides RP6, RP9, G33, and Site 2 peptide D815 were tested in the 
antagonist assay. 

Figure 38: Results of IGF-1 agonist assay using FDCP-2 cells. 
20 Site 1 peptides 20E2, S175, and RP9 were tested in the agonist assay. 

Figures 39: Results of agonist and antagonist studies with peptide 
monomers and dimers. Monomers: D815 and RP9; Dimers: D815-6aa- 
RP9 and D815-12aa-RP9. 

Figure 40: Results of agonist and antagonist studies with peptide 
25 monomers and dimers. Monomers: G33 and D815; Dimer: D815-6aa-G33. 

Figure 41 : Results of agonist and antagonist studies with peptide 
monomers and dimers. Monomers: G33, D815 and RP9; Dimers: D815- 
6aa-RP9 and D815-12aa-RP9. 

Figure 42: IGF-1 standard curve using FDC-P2 cells. 
30 Figures 43A-43B: Peptide monomers identified from G33 and RP6 

secondary libraries panned against IGF-1 R (SEQ ID NOS: 1262-1 432). 
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Figure 43A shows peptides from G33 secondary library; Figure 43B shows 
peptides from RP6 secondary library. 

Figures 44A-44B: Peptide dimers identified from libraries panned 
against IR or IGF-1R (SEQ ID NOS:1433-1540). Figure 44A shows dimer 
5 peptides panned against IR; Figure 44B shows dimer peptides panned 
against IGF-1R. 

Figure 45: Results of heterogeneous time-resolved fluorometric 
assays showing the effect of recombinant peptide G33 (rG33) on the binding 
of biotinylated-recombinant human IGF-1 (b-rhlGF-1) to recombinant human 
10 IGF-1 R(rhlGF-1R). 

Figure 46: Results of heterogeneous time-resolved fluorometric 
assays showing the effect of recombinant peptide D815 (rD815) on the 
binding of biotinylated-recombinant human IGF-1 (b-rhlGF-1) to recombinant 
human IGF-1 R (rhlGF-1R). 
15 Figure 47: Results of heterogeneous time-resolved fluorometric 

assays showing the effect of recombinant peptide RP9 on the binding of 
biotinylated-recombinant human IGF-1 (b-rhlGF-1) to recombinant human 
IGF-1 R(rhlGF-1R). 

Figure 48: Results of heterogeneous time-resolved fluorometric 
20 assay showing the effect of recombinant peptide D815-6-G33 on the binding 
of biotinylated-recombinant human IGF-1 (b-rhlGF-1) to recombinant human 
IGF-1 R(rhlGF-1R). 

Figure 49: Results of heterogeneous time-resolved fluorometric 
assays showing the effect of recombinant peptide D815-6-RP9 on the 
25 binding of biotinylated-recombinant human IGF-1 (b-rhlGF-1 ) to recombinant 
human IGF-1 R (rhlGF-1R). 

Figure 50: Results of heterogeneous time-resolved fluorometric 
assays showing the effect of recombinant peptide D815-12-RP9 on the 
binding of biotinylated-recombinant human IGF-1 (b-rhlGF-1) to recombinant 
30 human IGF-1 R (rhlGF-1 R). 
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Figure 51 : Results of heterogeneous time-resolved fluorometric 
assays showing the effect of IGF-1 on the binding of biotinylated- 
recombinant human IGF-1 (b-rhlGF-1) to recombinant human IGF-1 R 
(rhlGF-1R). 

5 Figure 52: Results of time-resolved fluorescence resonance 

energy transfer assays showing the effect of Site 1 peptides, Site 2 
peptides, and rhlGF-1 on the dissociation of biotinylated-20E2 (b-20E2, Site 
1 ) from recombinant human IGF-1 R. 

Figure 53: Results of time-resolved fluorescence resonance 

10 energy transfer assays showing the effect of various peptide monomers and 
dimers on the dissociation of biotinylated-20E2 (b-20E2, Site 1) from 
recombinant human IGF-1 R. 

Figures 54A-54B, 55A-55B, 56A-56B, 57A-57B, 58A-58B, 59A-59B, 
60A-60C, 61 A-61 B, 62A-62B, 63A-63B, and 64A-64B: Amino acid 

15 sequences identified by panning peptide libraries against IGF-1 R. The 
amino acids are represented by their one-letter abbreviation. The ratios 
over background are determined by dividing the signal at 405 nm (E-Tag, 
IGF-1 R, or IR) by the signal at 405 nm for non-fat milk. The IGF-1 R/IR ratio 
comparison is determined by dividing the ratio of IGF-1 R by the ratio of IR. 

20 The IR/IGF-1R ratio comparison is determined by dividing the ratio of IR by 
the ratio of IGF-1 R. Sp/lrr = the ratio of specific binding over non-specific 
binding; LDH = lactate dehydrogenase (negative control). 

Where included, the design of each library is shown in the first line in 
bold. In the design, symbol 'X 1 indicates a random position, an underlined 

25 amino acid indicates a doped position at the nucleotide level, and other 
positions are held constant. Symbols in the listed sequences include: Q 
indicates a position corresponding to a TAG stop codon; # indicates a 
position corresponding to a TAA stop codon; * indicates a position 
corresponding to a TGA stop codon; and ? indicates an unknown amino 

30 acid. The Q residues represent translation read-through at TAG stop 
codons. All libraries were designed with the short FLAG® Epitope DYKD 
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(SEQ ID NO:1545; Hopp et a/., 1988, Bio/Technology 6:1205-1210) at the 
N-terminus of the listed sequence and an E-tag epitope 
(GAPVPYPDPLEPR; SEQ ID NO:XX) at the C-terminus. 

Figures 54A-54B: Peptides identified from a RP6 secondary library 
5 panned against IGF-1R. The RP9 peptide is a Formula 1, Site 1 monomer. 

Figures 55A-55B: Peptides identified from a RP9-NPB25 
secondary library panned against IGF-1R. The RP9-NPB25 peptide is a 
Formula 2, Site 1 monomer with a 25 amino acid C-terminal extension. 

Figures 56A-56B: Peptides identified from a RP30-IGF-NPB20 
10 secondary library panned against IGF-1R. The RP30-IGF-NPB20 peptide is 
a Site 1 , Formula 2 monomer with a 20 amino acid C-terminal extension. 

Figures 57A-57B: Peptides identified from a NPB20-RP30-IGF 
secondary library panned against IGF-1 R. The NPB20-RP30-IGF peptide is 
a Site 1 , Formula 2 monomer with a 20 amino acid N-terminal extension. 
1 5 Figures 58A-58B: Peptides identified from a D81 5 secondary library 

panned against IGF-1 R. The D815 peptide is a Formula 6, Site 2 monomer. 

Figures 59A-59B: Peptides identified from a RP6-D815 secondary 
library panned against IGF-1 R. The RP6-D815 peptide is a Site 1-Site 2 
dimer with no linker. 

20 Figures 60A-60C: Peptides identified from a RP6-6aa-D815 

secondary library panned against IGF-1 R. The RP6-6aa-D815 peptide is a 
Site 1-Site 2 dimer with a 6 amino acid linker. 

Figures 61 A-61B: Peptides identified from a RP6-RP9 secondary 
library panned against IGF-1 R. The RP6-RP9 peptide is a Site 1-Site 1 
25 dimer with no linker. 

Figures 62A-62B: Peptides identified from a RP6-6aa-RP9 
secondary library panned against IGF-1 R. The RP6-6aa-RP9 peptide is a 
Site 1-Site 1 dimer with a 6 amino acid linker. 

Figures 63A-63B: Peptides identified from a D815-RP6 secondary 
30 library panned against IGF-1 R. The D815-RP6 peptide is a Site 2-Site 1 
dimer with no linker. 
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Figures 64A-64B: Peptides identified from a D815-6aa-RP6 
secondary library panned against IGF-1R. The D815-6aa-RP6 peptide is a 
Site 2-Site 1 dimer with a 6 amino acid linker. 

Figures 65A-65F: Dose related increase in cell proliferation of 
5 MiaPaCa and MCF-7 cells as measured in response to IGF-1 , IGF-2, and 
insulin. Cells were treated with either IGF-1 , IGF-2, or insulin. Figure 65A: 
Results for MiaPaCa cells incubated with IGF-1 ; Figure 65B: MiaPaCa cells 
incubated with IGF-2; Figure 65C: MiaPaCa cells incubated with insulin; 
Figure 65D: MCF-7 cells incubated with IGF-1; Figure 65E: MCF-7 cells 

10 incubated with IGF-2; Figure 65F: MCF-7 cells incubated with insulin. 

Figures 66A-66C: Peptide RP33-IGF competes with IGF-1 binding 
for binding to IGF-1 R and antagonizes receptor activity in cell-based assays. 
For competition experiments, the ALPHAScreen assay format was used 
(see below). For antagonism assays, RP33-IGF was added to cells, cells 

15 were incubated with IGF-1, and cell number was determined. Figure 66A: 
Inhibition of IGF-1 binding as a function of RP33-IGF concentration. Figure 
66B: Antagonism of IGF-1 R in MCF-7 cells by peptide RP33-IGF. Figure 
66C: Antagonism of IGF-1 R in MiaPaCa cells by peptide RP33-IGF. 

Figures 67A-67B: IGF-1 stimulates transient phosphorylation of 

20 IRS-1 in MCF-7 cells. Cells were stimulated with IGF-1 for 0, 2, 10, 30, 60 
minutes and total protein was immunoprecipitated for each analysis. Figure 
67A: Western blot analysis of endogenous IRS-1 ; Figure 67B: Western 
blot analysis of phosphorylated IRS-1; Lane 1: No addition; Lane 2: 2 
minute time point; Lane 3: 10 minute time point; Lane 4: 30 minute time 

25 point; Lane 5: 60 minute time point. 

Figures 68A-68B: Phosphorylation of IRS-1 in MCF-7 cells induced 
by IGF-1 is dose-dependant. Cells were exposed to increasing 
concentrations of IGF-1 and total protein was immunoprecipitated. 
Stimulation by 0.50 nM IGF-1 resulted in a sub-maximal level of 

30 phosphorylation that was consistently visualized in Western blot analysis. 
Figure 68A: Western blot analysis of endogenous IRS-1; Figure 68B: 
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Western blot analysis of phosphorylated IRS-1 ; Lane 1 : No addition; Lane 
2: 0.05 nM IGF-1; Lane 3: 0.1 nM IGF-1; Lane 4: 5 nM IGF-1; Lane 5: 1 
nM IGF-1; Lane 6: 0.5 nM IGF-1; Lane 7: 10 nM IGF-1; Lane 8: 50 nM 
IGF-1. 

5 Figures 69A-69B: Blockade of IGF-1-induced phosphorylation of 

IRS-1 in MCF-7 cells by synthetic peptides RP6KK and RP33-IGF. 
Unrelated peptides KCB1 (VSIGECGGLRHHRVRELCLV; SEQ ID NO:XX) 
and DGI3-D1 (ECRWFRPWRCPGLLSTGGGR; SEQ ID NO:XX) were used 
as negative controls. Figure 69A: Western blot analysis of expressed IRS- 

10 1; Figure 69B: Western blot analysis of phosphorylated IRS-1. Lane 1: no 
addition; Lane 2: DGI3-D1; Lane 3: KCB1; Lane 4: IGF-1 plus DGI3-D1; 
Lane 5: IGF-1 plus KCB1; Lane 6: IGF-1 plus RP6KK; Lane 7: IGF-1 plus 
RP33-IGF; Lane 8: IGF-1. 

Figures 70A-70C: Peptides RP54 and RP52 compete with IGF-1 for 

15 binding to IGF-1 R, and act as antagonists in cell proliferation assays. For 
antagonism assays, RP54 or RP52 was added to cells, cells were incubated 
with IGF-1, and cell number was determined. Figure 70A: Antagonism of 
IGF-1 R by RP54 in MCF-7 cells; Figure 70B: Antagonism of RP54 in 
MiaPaCa cells. Figure 70C: Antagonism of IGF-1 by RP52 in MCF-7 cells. 

20 Figures 71 A-71F: Peptide monomers with IGF-1R agonist or 

antagonist activity in MCF-7 or MiaPaCa cell proliferation assays compete 
against IGF-1 for binding to IGF-1 R. Potencies of peptide competition were 
determined using the AlphaScreen assay format (see below). Figure 71A: 
RP60 peptide; Figure 71 B: RP48 peptide; Figure 71 C: sG33 peptide; 

25 Figure 71 D: C1 peptide; Figure 71 E: L-RP9ex peptide; Figure 71 F: 12- 
RP9ex peptide. 

Figures 72A-72E: Peptide dimers with IGF-1 R agonist activity in 
MCF-7 or MiaPaCa cell proliferation assays compete with IGF-1 for binding 
to IGF-1 R. Potencies of peptide competition were determined using the 
30 AlphaScreen assay format (see below). Figure 72A: rRP30-IGF-12-D112 
peptide (Site 1-Site 1); Figure 72B: rRP30-IGF-12-RP31-IGF peptide (Site 
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1-Site 2); Figure 72C: rRP31 -IGF-1 2-RP30-IGF peptide (Site 2-Site 1); 
Figure 72D: rD1 12-12-RP30-IGF peptide (Site 1-Site 1); Figure 72E: 
rD112-12-D112 peptide (Site 1-Site 1). 

Figures 73A-73D: Peptide monomers with IGF-1 R agonist activity in 
5 MCF-7 or MiaPaCa cell proliferation assays. Figure 73A: RP60 peptide; 
Figure 73B: RP48 peptide; Figure 73C: G33 peptide; Figure 73D: L-RP9ex 
peptide. 

Figures 74A-74I: Peptide dimers with IGF-1 R agonist activity in 
MCF-7 or MiaPaCa cell proliferation assays. Figure 74A: RP30-IGF-12- 

10 D112 (Site 1-Site 1); Figure 74B: RP30-IGF-12-RP31-IGF (Site 1-Site 2); 
Figure 74C: RP31 -IGF-1 2-RP30-IGF (Site 2-Site 1); Figure 74D: D1 12-12- 
RP30-IGF (Site 1-Site 1); Figure 74E: RP6-L-D8B12 (Site 1-Site 2); Figure 
74F: D8B12-12-RP9 (Site 2-Site 1); Figure 74G: D112-12-D112 (Site 1- 
Site 1); Figure 74H: RP9-12-RP9 (Site 1-Site 1); Figure 74I: RP9-L-RP6 

15 (Site 1-Site 1). 

V. DETAILED DESCRIPTION OF THE INVENTION 

This invention relates to amino acid sequences comprising motifs that 
bind to the insulin receptor (IR) and/or insulin-like growth factor receptor 
(IGF-1 R). In addition to binding to IR and/or IGF-1 R, the amino acid 
20 sequences also possess either agonist, partial agonist or antagonist activity 
at IR or IGF-1 R. In addition, the amino acid sequences bind to separate 
binding sites (Sites 1 or 2) on IR or IGF-1 R. 

Although capable of binding to IR or IGF-1 R at sites which participate 
in conferring agonist or antagonist activity, the amino acid sequences are 
25 not based on the native insulin or IGF-1 sequences, nor do they reflect an 
obvious homology to any such sequences. 

The amino acid sequences of the invention may be peptides, 
polypeptides, or proteins. These terms as used herein should not be 
considered limiting with respect to the size of the various amino acid 
30 sequences referred to herein and which are encompassed within this 
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invention. Thus, any amino acid sequence comprising at least one of the IR 
or IGF-1R binding motifs disclosed herein, and which binds to IR or IGF-1R 
is within the scope of this invention. In preferred embodiments, the amino 
acid sequences confer insulin or IGF-1 agonist or antagonist activity. The 
5 amino acid sequences of the invention are typically artificial, i.e., non- 
naturally occurring peptides, polypeptides, or fragments thereof. The amino 
acid sequences of the invention do not include insulin, insulin-like growth 
factors, antibodies against insulin receptors or insulin-like growth factor 
receptors, or fragments thereof. Amino acid sequences useful in the 

10 invention may be obtained through various means such as chemical 
synthesis, phage display, cleavage of proteins or polypeptides into 
fragments, or by any means which amino acid sequences of sufficient length 
to possess binding ability may be made or obtained. 

The amino acid sequences provided by this invention should have an 

1 5 affinity for IR sufficient to provide adequate binding for the intended purpose. 
Thus, for use as a therapeutic, the peptide, polypeptide, or protein provided 
by this invention should have an affinity (K d ) of between about 10" 7 to about 
10" 15 M. More preferably the affinity is 10~ 8 to about 10~ 12 M. Most 
preferably, the affinity is 10' 10 to about 10" 12 M. For use as a reagent in a 

20 competitive binding assay to identify other ligands, the amino acid sequence 
preferably has affinity for the receptor of between about 10~ 5 to about 10" 12 
M. 

The present invention describes several different binding motifs, 
which bind to active sites on IR or IGF-1 R. The binding motifs are defined 

25 based on the analysis of several different amino acid sequences and 
analyzing the frequency that particular amino acids or types of amino acids 
occur at a particular position of the amino acid sequence as described in the 
related applications of Beasley et aL International Application 
PCT/US00/08528, filed March 29, 2000, and Beasley et aL, U.S. Application 

30 Serial No. 09/538,038, filed March 29, 2000. 
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Also included within the scope of this invention are amino acid 
sequences containing substitutions, additions, or deletions based on the 
teachings disclosed herein and which bind to IR or IGF-1R with the same or 
altered affinity. For example, sequence tags (e.g., FLAG® tags) or amino 
5 acids, such as one or more lysines, can be added to the peptide sequences 
of the invention (e.g., at the N-terminal or C-terminal ends) as described in 
detail herein. Sequence tags can be used for peptide purification or 
localization. Lysines can be used to increase peptide solubility or to allow 
for biotinylation. Alternatively, amino acid residues located at the carboxy 

10 and amino terminal regions of the consensus motifs described below, which 
comprise sequence tags (e.g., FLAG® tags), or which contain amino acid 
residues that are not associated with a strong preference for a particular 
amino acid, may optionally be deleted providing for truncated sequences. 
Certain amino acids (e.g., C-terminal or N-terminal residues) such as lysine 

15 which promote the stability or biotinylation of the amino acids sequences 
may be deleted depending on the use of the sequence, as for example, 
expression of the sequence as part of a larger sequence which is soluble, or 
linked to a solid support. 

Peptides that bind to IR or IGF-1R, and methods and kits for 

20 identifying such peptides, have been disclosed by Beasley et a/., 
International Application PCT/US00/08528 filed March 29, 2000 and 
Beasley et a/., U.S. Application Serial No. 09/538,038 filed March 29, 2000, 
which are incorporated by reference in their entirety. 

A. Consensus Motifs 

25 The following motifs have been identified as conferring binding 

activity to IR and/or IGF-1 R: 

1 . X 1 X 2 X 3 X4X5 (Formula 1 ; Group 1 ; the A6 motif) wherein X 1f X 2 , 
X4 and X 5 are aromatic amino acids, preferably, phenylalanine or tyrosine. 
Most preferably, X1 and X 5 are phenylalanine and X 2 is tyrosine. X 3 may be 
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any small polar amino acid, but is preferably selected from aspartic acid, 
glutamic acid, glycine, or serine, and is most preferably aspartic acid or 
glutamic acid. X4 is most preferably tryptophan, tyrosine, or phenylalanine 
and most preferably tryptophan. Particularly preferred embodiments of the 
5 A6 motif are FYDWF (SEQ ID NO: 1554) and FYEWF (SEQ ID NO: 1555). 
The A6 motif possesses agonist activity at IGF-1R, but agonist or antagonist 
activity at IR depending on the identity of amino acids flanking A6. See 
Figure 5A. 

Amino acid sequences that comprise the A6 motif and possess 
10 agonist activity at IR, include but are not limited to, D117/H2C: 
FHENFYDWFVRQVSKK (SEQ ID NO:1556); D117/H2 minus terminal 
lysines: FHENFYDWFVRQVS (SEQ ID NO:1557); RP9: 

GSLDESFYDWFERQLGKK (SEQ ID NO:1558); RP9 minus terminal 
lysines: GSLDESFYDWFERQLG (SEQ ID NO: 1559); and S175: 
15 GRVDWLQRNANFYDWFVAELG (SEQ ID NO:1560). Preferred RP9 
sequences include GLADEDFYEWFERQLR (SEQ ID NO:1561), 
GLADELFYEWFDRQLS (SEQ ID NO:1562), GQLDEDFYEWFDRQLS 
(SEQ ID NO:1563), GQLDEDFYAWFDRQLS (SEQ ID NO:1564), 
GFMDESFYEWFERQLR (SEQ ID NO: 1565), GFWDESFYAWFERQLR 
20 (SEQ ID NO:1566), GFMDESFYAWFERQLR (SEQ ID NO: 1567), and 
GFWDESFYEWFERQLR (SEQ ID NO:1568). Non-limiting examples of 
Group 1 (Formula 1; A6) amino acid sequences are shown in Figures 1A- 
10. 

2. XeX/XsXgX^XuX^X^ (Formula 2, Group 3; the B6 motif) 
25 wherein X 6 and X 7 are aromatic amino acids, preferably, phenylalanine or 
tyrosine. Most preferably, X 6 is phenylalanine and X 7 is tyrosine. Xs, X 9 , X 11t 
and X 12 may be any amino acid. X10 and X 13 are hydrophobic amino acids, 
preferably leucine, isoleucine, phenylalanine, tryptophan or methionine, but 
more preferably leucine or isoleucine. X 10 is most preferably isoleucine for 
30 binding to IR and leucine for binding to IGF-1R. X13 is most preferably 
leucine. Amino acid sequences of Formula 2 may function as an antagonist 
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at the IGF-1R, or as an agonist at the IR. Preferred consensus sequences 
of the Formula 2 motif are FYXaXgLXnX^L (SEQ ID NO: 1569), 
FYXsXqIXuX^L (SEQ ID NO:1570), FYXsAIXuX^L (SEQ ID NO:1571), and 
FYX 8 YFXnX 12 L (SEQ ID NO:1572). 
5 Another Formula 2 motif for use with this invention comprises FYXeY 

FX11X12L (SEQ ID NO:1573) and is shown as Formula 2A ("NNRP") below: 
X115X116X117X118FYX8YFX11X12LX119X120X121X122 (SEQ ID NO:1574) wherein 
Xn5-Xn 8 and X n8 -X 12 2 may be any amino acid which allows for binding to IR 
or IGF-1R. X115 is preferably selected from the group consisting of 

10 tryptophan, glycine, aspartic acid, glutamic acid, and arginine. Aspartic acid, 
glutamic acid, glycine, and arginine are more preferred. Tryptophan is most 
preferred. The preference for tryptophan is based on its presence in clones 
at a frequency three to five fold higher than that expected over chance for a 
random substitution, whereas aspartic acid, glutamic acid and arginine are 

1 5 present about two fold over the frequency expected for random substitution. 

Xi 16 preferably is an amino acid selected from the group consisting of 
aspartic acid, histidine, glycine, and asparagine. X 117 and Xn 8 are 
preferably glycine, aspartic acid, glutamic acid, asparagine, or alanine. 
More preferably Xn 7 is glycine, aspartic acid, glutamic acid and asparagine 

20 whereas X 118 is more preferably glycine, aspartic acid, glutamic acid or 
alanine. X 8 when present in the Formula 2A motif is preferably arginine, 
glycine, glutamic acid, or serine. Xn when present in the Formula 2A motif 
is preferably glutamic acid, asparagine, glutamine, or tryptophan, but most 
preferably glutamic acid. X 12 when present in the Formula 2A motif is 

25 preferably aspartic acid, glutamic acid, glycine, lysine or glutamine, but most 
preferably aspartic acid. X 119 is preferably glutamic acid, glycine, glutamine, 
aspartic acid or alanine, but most preferably glutamic acid. X120 is preferably 
glutamic acid, aspartic acid, glycine or glutamine, but most preferably 
glutamic acid. X 12 i is preferably tryptophan, tyrosine, glutamic acid, 

30 phenylalanine, histidine, or aspartic acid, but most preferably tryptophan or 
tyrosine. X122 is preferably glutamic acid, aspartic acid or glycine; but most 
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preferably glutamic acid. Preferred amino acid residue are identified based 
on their frequency in clones over two fold over that expected for a random 
event, whereas the more preferred sequences occur about 3-5 times as 
frequently as expected. 
5 In certain cases, Formula 1 and Formula 2 amino acid sequences 

may also include two cysteine residues, which may be positioned either 
outside or inside the motif sequence (e.g., Xi X 2 X3 X4 X 5 and 
X6X7X8X9X10X11X12X13), as described herein. The spacing between the 
cysteine residues preferably may vary from 3 amino acids, e.g., RP62 

10 (CDFYCALSRLSGQPRDRMPNYPGTS; SEQ ID NO:XX) up to 19 amino 
acids, e.g., RP35 (DRDFCRFYERLTALVGGQVDGGWPC; SEQ ID NO:XX). 
Formula 1 and Formula 2 peptides may exhibit varying size and cysteine 
positioning. For example, Formula 2 peptide RP6 

(TF YS CL AS LLTGTPQ P N RG P W E RC R ; SEQ ID NO:XX) and derivatives, 

15 RP30-IGF, RP33-IGF, include two cysteine residues separated by 18 amino 
acids. In contrast, Formula 1 peptide G33 

(GIISQSCPESFYDWFAGQVSDPWWCW; SEQ ID NO:XX) includes two 
cysteines separated by 17 amino acid residues. In certain Formula and 
Formula 2 peptides, the position and spacing of the cysteine residues was 

20 found to be highly preferred in these peptides as determined by calculations 
of amino acid preferences from peptides obtained by biopanning of RP6 and 
G33 secondary libraries. Without wishing to be bound by theory, it is 
possible that the cysteine pairs observed in Formula 1 and Formula 2 amino 
acid sequences form cysteine loop structures. 

25 3. Xi 4 Xi5Xi6X 1 7Xi8X 1 9X 2 oX2i (Formula 3, reverse B6, revB6), 

wherein X 14 and X 17 are hydrophobic amino acids; X 14 , X 17 are preferably 
leucine, isoleucine, and valine, but most preferably leucine; Xi 5i X 16t X 18 and 
X19 may be any amino acid; X 2 o is an aromatic amino acid, preferably 
tyrosine or histidine, but most preferably tyrosine; and X21 is an aromatic 

30 amino acid, but preferably phenylalanine or tyrosine, and most preferably 
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phenylalanine. For use as an IGF-1R binding ligand, an aromatic amino 
acid is strongly preferred at X18. 

4. X22X23X24X25X26X27X28X29X30X31X32X33X34X35X36X37X38X39X40 
X41 (Formula 4; Group 7; the F8 motif) wherein X22, X 25 , X 26 , X 28 , X 29 , X 30 , 
5 X33, X34, X35, X36, X37, X 38> X40, and X41 are any amino acid. X 35 and X37 may 
be any amino acid when the F8 motif is used as an IR binding ligand or as a 
component of an IR binding ligand, however for use as an IGF-1R binding 
ligand, glycine is strongly preferred at X37 and a hydrophobic amino acid, 
particularly, leucine, is preferred at X35 X 23 is a hydrophobic amino acid. 

10 Methionine, valine, leucine or isoleucine are preferred amino acids for X23, 
however, leucine which is most preferred for preparation of an IGF-1R 
binding ligand is especially preferred for preparation of an IR binding ligand. 
At least one cysteine is located at X24 through X 2 7, and one at X 39 or X40. 
Together the cysteines are capable of forming a cysteine cross-link to create 

15 a looped amino acid sequence. In addition, although a spacing of 14 amino 
acids in between the two cysteine residues is preferred, other spacings may 
also be used provided binding to IGF-1R or IR is maintained. Accordingly, 
other amino acids may be substituted for the cysteines at positions X 2 4 and 
X 39 if the cysteines occupy other positions. 

20 In one embodiment, for example, the cysteine at position X 2 4 may 

occur at position X 2 7 which will produce a smaller loop provided that the 
cysteine is maintained at position X 39 . These smaller looped peptides are 
described herein as Formula 5, infra. X 2 7 is any polar amino acid, but is 
preferably selected from glutamic acid, glutamine, aspartic acid, asparagine, 

25 or as discussed above cysteine. The presence of glutamic acid at position 
X27 decreases binding to IR but has less of an effect on binding to IGF-1R. 
X31 is any aromatic amino acid and X32 is any small amino acid. For binding 
to IGF-1R, glycine or serine is preferred at position X31, however, tryptophan 
is highly preferred for binding to IR. At position X 32 , glycine is preferred for 

30 both IGF-1R and IR binding. X 36 is an aromatic amino acid. A preferred 
consensus sequence for F8 is X 2 2LCX25X26EX 28 X29X3oWGX33X34X3 5 
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X36X37X38CX40X41 (SEQ ID NO:1575) whereas the amino acids are defined 
above. A more preferred F8 sequence is HLCVLEELFWGASLFGYCSG 
("F8"; SEQ ID NO: 1576). Amino acid sequences comprising the F8 
sequence motif preferably bind to IR over IGF-1R. Figures 2A-2E list non- 
5 limiting examples of Formula 4 amino acid sequences. 
5. 

X42X43X44X45X46X47X48X49X50X51X52X53X54X55X56X57X58X59X60X61 
(Formula 5; mini F8 motif) wherein X42, X43, X44, X45, X 53 , X 55 , X 56 , X 58 , Xeo, 
and X 6 i are any amino acid. X43, X46, X49, X 50 , and X54 are hydrophobic 

10 amino acids, however, X43 and X46 are preferably leucine, whereas X 50 is 
preferably phenylalanine or tyrosine but most preferably phenylalanine. X47 
and X59 are cysteines. X48 is preferably a polar amino acid, i.e., aspartic 
acid or glutamic acid, but most preferably glutamic acid. Use of the small 
amino acid at position 54 may confer IGF-1R specificity. X51, X 52| and X 57 

15 are small amino acids, preferably glycine. A preferred consensus sequence 
for mini F8 is X42X43X44X45LCEX49FGGX53X54X55X56G X 58 CX 6 oX6i (SEQ ID 
NO:1577). Amino acid sequences comprising the sequence of Formula 5 
preferably bind to IGF-1R or IR. 

6 . X62X63X64X65X66X67X68X69X70X7 1 X72X73X74X75X76X77X78X79X80 

20 X 81 (Formula 6; Group 2; the D8 motif) wherein X 62 , X 65 , X 68 , X 69 , X71, X 73 , 
X 76t X77, X78, X 80 , and X 8 i may be any amino acid. X66 may also be any 
amino acid, however, there is a strong preference for glutamic acid. 
Substitution of X 6 e with glutamine or valine may result in attenuation of 
binding. X 6 3, X 7 o, and X74 are hydrophobic amino acids. X 6 3 is preferably 

25 leucine, isoleucine, methionine, or valine, but most preferably leucine. X 70 
and X74 are preferably valine, isoleucine, leucine, or methionine. X 74 is most 
preferably valine. X64 is a polar amino acid, more preferably aspartic acid or 
glutamic acid, and most preferably glutamic acid. X 6 7 and X75 are aromatic 
amino acids. Whereas tryptophan is highly preferred at X 6 7, X 75 is preferably 

30 tyrosine or tryptophan but most preferably tyrosine. X72 and X 79 are 
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cysteines that again are believed to form a loop which position amino acid 
may be altered by shifting the cysteines in the amino acid sequence. 

D8 is most useful as an amino acid sequence having a preference for 
binding to IR as only a few D8 sequences capable of binding to IGF-1R over 
5 background have been detected. A preferred sequence for binding to IR is 

X62LX64X65X66WX 68 X 6 9X7oX7lCX73X74X75X76X77X78CX8oX8i (SEQ ID 

NO: 1578). Examples of specific peptide sequences comprising this motif 
include D8: KWLDQEWAWVQCEVYGRGCPSKK (SEQ ID NO:1579); and 
D8 minus terminal lysines: KWLDQEWAWVQCEVYGRGCPS (SEQ ID 

10 NO: 1580). Preferred D8 monomer sequences include 

SLEEEWAQIQCEIYGRGCRY (SEQ ID NO:1581) and 
SLEEEWAQIQCEIWGRGCRY (SEQ ID NO:1582). Preferred D8 dimer 
sequences include SLEEEWAQIECEVYGRGCPS (SEQ ID NO: 1583), and 
SLEEEWAQIECEVWGRGCPS (SEQ ID NO:1584). Non-limiting examples 

15 of Group 2 (Formula 6; D8) amino acid sequences are shown in Figures 3A- 
3E. 

7. HX82X83X84X85X86X87X88X89X 9 oX 91 X92 (Formula 7) wherein Xs2 
is proline or alanine but most preferably proline; Xe3 is a small amino acid 
more preferably proline, serine or threonine and most preferably proline; Xs4 

20 is selected from leucine, serine or threonine but most preferably leucine; X 8 s 
is a polar amino acid preferably glutamic acid, serine, lysine or asparagine 
but more preferably serine; X 8 6 may be any amino acid but is preferably a 
polar amino acid such as histidine, glutamic acid, aspartic acid, or 
glutamine; X 8 7 is an aliphatic amino acid preferably leucine, methionine or 

25 isoleucine and most preferably leucine; amino acid X 88 , X 89 and X 90 may be 
any amino acids; X 9 i is an aliphatic amino acid with a strong preference for 
leucine as is X 92 . Phenylalanine may also be used at position 92. A 
preferred consensus sequence of Formula 7 is HPPLSXseLXesXsgXgoLL 
(SEQ ID NO: 1585). The Formula 7 motif binds to IR with little or no binding 

30 tolGF-1R. 
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8. Another sequence is XKHXiosXioeXioyXiosXiosXnoXmXnaXna 
Xi 14 (Formula 8) which comprises eleven amino acids wherein at least one, 
and preferably two of the amino acids of X i0 6 through Xm are tryptophan. 
In addition, it is also preferred that when two tryptophan amino acids are 

5 present in the sequence they are separated by three amino acids, which are 
preferably, in sequential order proline, threonine and tyrosine with proline 
being adjacent to the tryptophan at the amino terminal end. Accordingly, the 
most preferred sequence for Xi 07X103X1 09X1 i 0 Xm is WPTYW (SEQ ID 
NO: 1586). At least one of the three amino acids on the amino terminal 

10 (X 10 4, X105, Xioe) and at least one of the amino acids carboxy terminal (X 1i2 , 
X113, X 114 ) ends immediately flanking Xi 0 7-Xm are preferably a cysteine 
residue, most preferably at X 10 5 and X 1i3 respectively. Without being bound 
by theory, the cysteines are preferably spaced so as to allow for the 
formation of a loop structure. X104 and Xn 4 are both small amino acids such 

15 as, for example, alanine and glycine. Most preferably, X i0 4 is alanine and 
X114 is glycine. X105 may be any amino acid but is preferably valine. Xn 2 is 
preferably asparagine. Thus, the most preferred sequence is 
ACVWPTYWNCG (SEQ ID NO:1587). 

9. An amino acid sequence comprising JBA5: 
20 DYKDLCQSWGVRIGWLAGLCPKK (SEQ ID NO: 1541); or JBA5 without 

terminal lysines: LCQSWGVRIGWLAGLCP (SEQ ID NO:1542) (Formula 9). 
The Formula 9 motif is another motif believed to form a cysteine loop that 
possesses agonist activity at both IR and IGF-1R. Although IR binding is 
not detectable by ELISA, binding of Formula 9 to I R is competed by insulin 
25 and is agonistic. 

10. W X 123 G Y X 124 W X 125 X 126 (SEQ ID NO:1543) (Formula 10; 
Group 6) wherein X i23 is selected from proline, glycine, serine, arginine, 
alanine or leucine, but more preferably proline; X i24 is any amino acid, but 
preferably a charged or aromatic amino acid; X i25 is a hydrophobic amino 

30 acid preferably leucine or phenylalanine, and most preferably leucine. Xi 26 
is any amino acid, but preferably a small amino acid. In one embodiment of 
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the present invention, the Formula 10, Group 6 motif is WPGY (SEQ ID 
NO:1588). Examples of specific peptide sequences comprising this motif 
include E8: KVRGFQGGTVWPGYEWLRNAAKK (SEQ ID NO:1589); and 
E8 minus terminal lysines: KVRGFQGGTVWPGYEWLRNAA (SEQ ID 
5 NO:1590). Preferred Group 6 sequences include WAGYEWF (SEQ ID 
NO:1591), WEGYEWL (SEQ ID NO:1592), WAGYEWL (SEQ ID NO:1593), 
WEGYEWF (SEQ ID NO:1594), and DSDWAGYEWFEEQLD (SEQ ID 
NO: 1595). Non-limiting examples of Group 6 amino acid sequences are 
shown in Figures 4A-4B. 

10 The IR and IGF-1R binding activities of representative Group 1 

(Formula 1; A6); Group 2 (Formula 6; D8); and Group 6 (Formula 10); and 
Group 7 (Formula 4; F8) amino acid sequences are summarized in Figures 
8 and 9A-9B. Group 1 (Formula 1; A6) amino acid sequences contain the 
consensus sequence FyxWF (SEQ ID NO:1596), which is typically agonistic 

15 in cell-based assays. Group 2 (Formula 6; D8) amino acid sequences are 
composed of two internal sequences having a consensus sequence VYGR 
(SEQ ID NO: 1597) and two cysteine residues each. Thus, Group 2 peptides 
are capable of forming a cyclic peptide bridged with a disulfide bond. 
Neither of these consensus sequences have any significant linear sequence 

20 similarities greater than 2 or 3 amino acids with mature insulin. Group 7 
(Formula 4; F8) amino acid sequences are composed of two internal 
exemplary sequences which do not have any significant sequence 
homology, but have two cysteine residues 13-14 residues apart, thus being 
capable of forming a cyclic peptide with a long loop anchored by a disulfide 

25 bridge. 

B. Amino And Carboxyl Terminal Extensions Modulate 
Activity of Motifs 

In addition to the motifs stated above, the invention also provides 
preferred sequences at the amino terminal or carboxyl terminal ends which 
30 are capable of enhancing binding of the motifs to either IR, IGF-1R, or both. 
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In addition, the use of the extensions described below does not preclude the 
possible use of the motifs with other substitutions, additions or deletions that 
allow for binding to IR, IGF-1 R, or both. 

1. Formula 1 

Any amino acid sequence may be used for extensions of the amino 
terminal end of A6, although certain amino acids in amino terminal 
extensions may be identified which modulate activity. Preferred carboxy 
terminal extensions for A6 are A6-X93X94X95X96X97 wherein X 93 may be any 
amino acid, but is preferably selected from the group consisting of alanine, 
valine, aspartic acid, glutamic acid, and arginine, and X94 and X 97 are any 
amino acid; X 95 is preferably glutamine, glutamic acid, alanine or lysine but 
most preferably glutamine. The presence of glutamic acid at X 95 however 
may confer some IR selectivity. Further, the failure to obtain sequences 
having an asparagine or aspartic acid at position X 95 may indicate that these 
amino acids should be avoided to maintain or enhance sufficient binding to 
IR and IGF-1 R. X 96 is preferably a hydrophobic or aliphatic amino acid, 
more preferably leucine, isoleucine, valine, or tryptophan but most 
preferably leucine. Hydrophobic residues, especially tryptophan at X 96 may 
be used to enhance IR selectivity. 

20 2. Formula 2 

B6 with amino terminal and carboxy terminal extensions may be 
represented as X98X99-B6-X100. X 98 is optionally aspartic acid and X 99 is 
independently an amino acid selected from the group consisting of glycine, 
glutamine, and proline. The presence of an aspartic acid at X 98 and a 
25 proline at X 99 is associated with an enhancement of binding for both IR and 
IGF-1 R. A hydrophobic amino acid is preferred for the amino acid at X100, an 
aliphatic amino acid is more preferred. Most preferably leucine, for IR and 
valine for IGF-1 R. Negatively charged amino acids are preferred at both the 
amino and carboxy terminals of Formula 2A. 



10 
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3. Formula 3 

An amino terminal extension of Formula 3 defined as Xi 0 iX 10 2Xi 0 3- 
revB6 wherein X103 is a hydrophobic amino acid, preferably leucine, 
isoleucine or valine, and X 10 2 and X i0 i are preferably polar amino acids, 
5 more preferably aspartic acid or glutamic acid may be useful for enhancing 
binding to IR and IGF-1R. No preference is apparent for the amino acids at 
the carboxy terminal end of Formula 3. 

4. Formula 10 

In one preferred embodiment, Formula 10 sequences 
10 WX^aGYX^WX^sX^e (SEQ ID NO:1543) can include an amino terminal 
extension comprising the sequence DSD and/or a carboxy terminal 
extension comprising the sequence EQLD (SEQ ID NO:1598). 

C. IR Binding Preferences 

As indicated above, the amino acid sequences containing the motifs 
15 of this invention may be constructed to have enhanced selectivity for either 
IR or IGF-1R by choosing appropriate amino acids at specific positions of 
the motifs or the regions flanking them. By providing amino acid 
preferences for IR or IGF-1 R, this invention provides the means for 
constructing amino acid sequences with minimized activity at the non- 
20 cognate receptor. For example, the amino acid sequences disclosed herein 
with high affinity and activity for IR and low affinity and activity for IGF-1R 
are desirable as IR agonist as their propensity to promote undesirable cell 
proliferation, an activity of IGF-1 agonists, is reduced. Ratios of IR binding 
affinity to IGF-1 R binding affinity for specific sequences are provided in 
25 Figures 1A-10; 2A-2E; 3A-3E; 4A-4I; 44A-44B. As an insulin therapeutic, 
the IR/IGF-1R binding affinity ratio is preferably greater than 100. 
Conversely, for use as an IGF-1 R therapeutic, the IR/IGF-1R ratio should be 
less than 0.01. Examples of peptides that selectively bind to IGF-1 R are 
shown below. 
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Besides relative binding at IR or IGF-1R, relative efficacy at the 
cognate receptor is another important consideration for choosing a potential 
therapeutic. Thus, a sequence that is efficacious at IR but has little or no 
significant activity at IGF-1R may also be considered as an important IR 
5 therapeutic, irrespective of the relative binding affinities at IR and IGF-1R. 
For example, A6 selectivity for IR may be enhanced by including glutamic 
acid in a carboxyl terminal extension at position X 95 . IR selectivity of the B6 
motif may be enhanced by having a tryptophan or phenylalanine at Xn. 
Tryptophan at X13 also favors selectivity of IR. A tryptophan amino acid at 

10 X13 rather than leucine at that position also may be used to enhance 
selectivity for IR. In the reverse B6 motif, a large amino acid at X 15 favors IR 
selectivity. Conversely, small amino acids may confer specificity for IGF-1R. 
In the F8 motif, an L in position X 2 3 is essentially required for IR binding. In 
addition, tryptophan at X 3 i is also highly preferred. At X32, glycine is 

1 5 preferred for IR selectivity. 

D. Multiple Binding Sites On IR And IGF-1R 

The competition data disclosed herein reveals that at least two 
separate binding sites are present on IR and IGF-1R which recognize the 
different sequence motifs provided by this invention. 

20 As shown in Figure 6, competition data indicate that peptides 

comprising the A6 motifs compete for binding to the same site on IR (Site 1) 
whereas the D8 motifs compete for a second site (Site 2). The identification 
of peptides that bind to separate binding sites on IR and IGF-1R provides for 
various schemes of binding to IR or IGF-1R to increase or decrease its 

25 activity. Examples of such schemes for IR are illustrated in Figure 7. 

The table below shows sequences based on their groups, which bind 
to Site 1 or Site 2. 
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TABLE 2 



REPRESENTATIVE SITE 1 PEPTIDES 



A6-like (FYxWF) (SEQ ID NO:1596): 



Clone 


Sequence 


SEQ ID NO: 


G3 


KRGGGTFYEWFESALRKHGAGKK 


1718 


H2 


VTFTSAVFHENFYDWFVRQVS KK 


1719 


H2C 


FHENFYDWFVRQVSKK 


1556 


A6S-IR3-E12 


G R VDWL QRNANFYD WFVAE L G 


1560 


A6S-IR4-G1 


NGVERAGTGDNFYDWFVAQLH 


1720 


H2CB-R3-B12 


QSDSGTVHDRFYGWFRDTWAS 


1721 


20E2A-R3-B11 


GRFYGWFQDAI DQLMPWGFDP 


1722 


rB6-F6 


RYGRWGLAQQFYDWFDR 


1723 


E4Dct-l-B8-IR~ 


G FRE GQRW YWFVAQVT 


1724 


H2CA-4-F11-IR 


TYKARFLHENFYDWFNRQVSQYFGRV 


1725 


H2CB-R3-D2 


WTDVDGFHSGFYRWFQNQWER 


1726 


H2CB-R3-D12 


VASGHVLHGQFYRWFVDQFAL 


1727 


H2CB-R4-H5 


QARVGNVHQQFYEWFREVMQG 


1728 


H2C-B-E8* 


TGHRLGLDEQFYWWFRDALSG 


1729 


H2CB-3-B6-IR- 


VGDFCVSHDCFYGWFLRESMQ 


1730 


A6S-IR2-C1 


RMYFSTGAPQNFYDWFVQEWD 


1731 



B6-like (FYxxLxxL) (SEQ ID NO:1732): 

Clone Sequence 

25 20C11 KDRAFYNGLRDLVGAVYGAWDKK 1733 

20E2 DYKDFYD AIDQLVRG S ARAGGTRDKK 1734 

B62-R3-C7 EHWNTVDPFYFTLFEWLRESG 1735 

B62-R3-C10 EHWNTVDPFYQYFSELLRESG 1736 

30 20E2B-3-B3-IR AGVNAGFYRYFSTLLDWWDQG 1737 

2 0E2 - B- E3 * IQGWEPFYGWFDDWAQMFEE 173 8 

20E2A-R4 -F9 PPWGARFYDAI EQLVFDNLCC 173 9 

RPNN - 4 - G6 - HOLO * RWPNFYG YFES LLTHFS 174 0 

RPNN-4-F3-HOLO* HYNAFYEYFQVLLAETW 1741 

35 20E2A-R4-E2 IGRVRS FYDAI DKLFQSDWER 1742 

RPNN-2-C1-IR* EGWDFYSYFSGLLASVT 1743 

20E2B-4-F12-IR SVKEVQ FYRYF YDLLQS EESG 1744 

20E2-B-E12 GNSGGS FYRYFQLLLDSDGMS 174 5 

20E2A-R3-B6 RDAGSS FYDAI DQLVCLT YFC 174 6 

40 

Reverse B6-like (LxxLxxYF) (SEQ ID NO:1747): 

Clone Sequence 

rB6-A12 LDALDRLMRYFEERPSL 1748 

rB6-F9 PLAELWAYFEHSEQGRSSAH 1749 

TB6-4-E7-IR LDPLDALLQYFWSVPGH 1750 

rB6-4-F9-IR RGRLGSLSTQFYNWFAE 1751 

rB6-E6 ADELEWLLDYFMHQPRP 17 52 

rB6-4-F12-IR DGVLEELFSYFSATVGP 1753 

Group 6 (WPxYxWL) (SEQ ID NO:1754): 

Clone Sequence 

R20p-4-A4-IR WPGYLFFEEALQDWRGSTED 1755 

55 Peptides by design**: 

Clone Sequence 

H2C-PD1-IR- AAVHEQFYDWFADQYKK 1756 

A6S-PD1-IR- QAPSNFYDWFVREWDKK 175 7 



45 



50 



WO 03/027246 



PCT/US02/30412 



-49- 

20E2-PD1-IR- QSFYDYIEELLGGEWKK 1758 

B6C-PD1-IR- DPFYQGLWEWLRESGKK 1759 



5 REPRESENTATIVE SITE 2 PEPTIDES (C-C LOOPS) 



F8-derived (Long C-C loop): 



10 



15 



20 



25 



30 



35 



40 



45 



Clone 
F8 

F8-C12 
F8-Des2 

F8-F12 

F8-B9 

F8-B12 

NNKH-2B3 

NNKH-2F9- 

NNKH-4H4- 



Sequence 

HLCVLEELFWGASLFGYCSG 
FQSLLEELVWGAPLFRYGTG 
PLCVLEELFWGASLFGYCSG 

PLCVLEELFWGASLFGQCSG 
HLCVLEELFWGASLFGQCSG 
DLRVLCELFGGAYVLGYCSE 

HRSVLKQLSWGASLFGQWAG 
HLSVGEELSWWVALLGQWAR 
APVSTEELRWGALLFGQWAG 



D8-derived (Small C-C loop): 

Clone Sequence 

D8 KWLDQEWAWVQCEVYGRGCPSKK 

D8 - Gl QLEEEWAGVQCEVYGRECPS 

D8-B5- ALEEE WAWVQVRS I RSGLPL 

D8 - A7 SLDQEWAWVQCEVYGRGCLS 

D8 - Fl - WLEHEWAQIQCELYGRGCTY 



SEQ ID NO: 
1760 
1761 
1762 

1763 
1764 
1765 

1766 
1767 
1768 



SEQ ID NO: 

1769 

1770 

1771 
1772 
1773 



Midi C-C loop: 

Clone 
D8-F10 
F8-B12- 
F8-A9 



Sequence 

GLEQGCPWVGLEVQCRGCPS 
DLRVLCELFGGAYVLGYCSE 
PLWGLCELFGGASLFGYCSS 



1774 
1775 
1776 



**Based on analysis of entire panning data, amino acid preferences at each position were calculated 
to define these "idealized" peptides; * Peptides synthesized and currently being purified; - Peptides 
planned. 

In various aspects of the present invention, amino acid sequences 
comprising Site 1 motifs may bind to Site 1 of IR or Site 1 of IGF-1R. 
Similarly, amino acids sequences comprising Site 2 motifs may bind to Site 
2 of IR or Site 2 of IGF-1R. However, specific peptides may show higher 
binding affinity for IR than for IGF-1R, while other peptides may show higher 
binding affinity for IGF-1 R than for IR. In addition, Site 1 and Site 2 on IR do 
not cross-talk, i.e., Site 1 -binding sequences do not compete with Site 2- 
binding sequences at IR. In contrast, Site 1 and Site 2 on IGF-1 R do show 
some cross-talk, suggesting an allosteric effect. These aspects are 
illustrated in the Examples described hereinbelow. 
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E. Multivalent Ligands 

This invention provides ligands that preferentially bind different sites 
on IR and IGF-1R. The A6 amino acid sequence motif confers binding to IR 
at Site 1 (Figure 6). The D8 amino acid sequence motif confers binding to 
5 IR at Site 2 (Figure 6). Accordingly, multimeric ligands may be prepared 
according to the invention by covalently linking amino acid sequences. 
Depending on the purpose intended for the multivalent ligand, amino acid 
sequences that bind the same or different sites may be combined to form a 
single molecule. Where the multivalent ligand is constructed to bind to the 

10 same corresponding site on different receptors, or different subunits of a 
receptor, the amino acid sequences of the ligand for binding to the receptors 
may be the same or different, provided that if different amino acid 
sequences are used, they both bind to the same site. 

Multivalent ligands may be prepared by either expressing amino acid 

15 sequences which bind to the individual sites separately and then covalently 
linking them together, or by expressing the multivalent ligand as a single 
amino acid sequence which comprises within it the combination of specific 
amino acid sequences for binding. 

Various combinations of amino acid sequences may be combined to 

20 produce multivalent ligands having specific desirable properties. Thus, 
agonists may be combined with agonists, antagonists combined with 
antagonists, and agonists combined with antagonists. Combining amino 
acid sequences that bind to the same site to form a multivalent ligand may 
be useful to produce molecules that are capable of cross-linking together 

25 multiple receptor units. Multivalent ligands may also be constructed to 
combine amino acid sequences which bind to different sites (Figure 7). 

In view of the discovery disclosed herein of monomers having agonist 
properties at IR or IGF-1R, preparation of multivalent ligands may be useful 
to prepare ligands having more desirable pharmacokinetic properties due to 

30 the presence of multiple bind sites on a single molecule. In addition, 
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combining amino acid sequences that bind to different sites with different 
affinities provides a means for modulating the overall potency and affinity of 
the ligand for IR or IGF-1 R. 

1 . Construction of Hybrids 

5 In one embodiment, hybrids of at least two peptides (e.g., dimer 

peptides) may be produced as recombinant fusion polypeptides, which are 
expressed in any suitable expression system. The polypeptides may bind 
the receptor as either fusion constructs containing amino acid sequences 
besides the ligand binding sequences or as cleaved proteins from which 

10 signal sequences or other sequences unrelated to ligand binding are 
removed. Sequences for facilitating purification of the fusion protein may 
also be expressed as part of the construct. Such sequences optionally may 
be subsequently removed to produce the mature binding ligand. 
Recombinant expression also provides means for producing large quantities 

15 of ligand. In addition, recombinant expression may be used to express 
different combinations of amino acid sequences and to vary the orientation 
of their combination, i.e., amino to carboxyl terminal orientation. 

In one embodiment shown below (Figure 28), MBP-FLAG®- 
PEPTIDE-(GGS) n (SEQ ID NO:1777)-PEPTIDE-E-TAG, a fusion construct 

20 producing a peptide dimer comprises a maltose binding protein amino acid 
sequence (MBP) or similar sequence useful for enabling the affinity 
chromatography purification of the expressed peptide sequences. This 
purification facilitating sequence may then be attached to a FLAG® 
sequence to provide a cleavage site to remove the initial sequence. The 

25 dimer then follows which includes the intervening linker and a tag sequence 
may be included at the carboxyl terminal portion to facilitate 
identification/purification of the expression of peptide. In the representative 
construct illustrated above, G and S are glycine and serine residues, which 
make up the linker sequence. As non-limiting examples, n can be 1, 2, 3, or 

30 4 to yield a linker sequence of 3, 6, 9, and 12 amino acids, respectively. 
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ln addition to producing the dimer peptides by recombinant protein 
expression, dimer peptides may also be produced by peptide synthesis 
whereby a synthetic technique such as Merrifield synthesis (Merrifield, 
1 997), may be used to construct the entire peptide. 
5 Other methods of constructing dimer peptides include introducing a 

linker molecule that activates the terminal end of a peptide so that it can 
covalently bind to a second peptide. Examples of such linkers include, but 
are not limited to, diaminoproprionic acid activated with an oxyamino 
function. A preferred linker is a dialdehyde having the formula 0=CH- 

10 (CH 2 ) n -CH=0, wherein n is at least 2 to 6, but is preferably 6 to produce a 
linker of about 25 to 30 angstroms in length. Other preferred linkers are 
shown in Table 3. Linkers may be used, for example, to couple monomers 
at either the carboxyl terminal or the amino terminal ends to form dimer 
peptides. Also, the chemistry can be inverted, i.e., the peptides to be 

15 coupled can be equipped with aldehyde functions, either by oxidation with 
sodium periodate of an N-terminal serine, or by oxidation of any other vicinal 
hydroxy- or amino-groups, and the linker can comprise two oxyamino 
functions (e.g., at end of a polyethylene glycol linker) or amino groups which 
are coupled by reductive amination. 

20 In specific embodiments, Site 1-Site 2 and Site 2-Site 1 orientations 

are possible. In addition, N-terminal to N-terminal (N-N); C-terminal to C- 
terminal (C-C); N-terminal to C-terminal (N-C); and C-terminal to N-terminal 
(C-N) linkages are possible. Accordingly, peptides may be oriented Site 1 to 
Site 2, or Site 2 to Site 1, and may be linked N-terminus to N-terminus, C- 

25 terminus to C-terminus, N-terminus to C-terminus, or C-terminus to N- 
terminus. In certain cases, a specific orientation may be preferable to 
others, for example, for maximal agonist or antagonist activity. 

In an unexpected and surprising result, the orientation and linkage of 
the monomer subunits has been found to dramatically alter dimer activity 

30 (see Examples, below). In particular, certain Site 1/Site 2 heterodimer 
sequences show agonist or antagonist activity at IR, depending on 
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orientation and linkage of the constituent monomer subunits. For example, 
a Site 1-Site 2 orientation (C-N linkage), e.g., the S453 heterodimer, shows 
antagonist activity at IR (Figure 18A; Table 7). In contrast, a Site 2-Site 1 
orientation (C-N linkage), e.g., the S455 heterodimer, shows potent agonist 
5 activity at IR (Figure 18D; Table 7). Similarly, Site 1-Site 2 (C-N linkage) 
heterodimers, e.g., S425 and S459, show antagonist activity at IR (Table 7), 
while Site 1-Site 2 (C-C or N-N linkage) heterodimers, e.g., S432-S438, 
S454, and S456, show agonist activity (Table 7). 

Whether produced by recombinant gene expression or by 

10 conventional chemical linkage technology, the various amino acid 
sequences may be coupled through linkers of various lengths. Where linked 
sequences are expressed recombinantly, and based on an average amino 
acid length of about 4 angstroms, the linkers for connecting the two amino 
acid sequences would typically range from about 3 to about 12 amino acids 

15 corresponding to from about 12 to about 48 A. Accordingly, the preferred 
distance between binding sequences is from about 2 to about 50 A. More 
preferred is 4 to about 40. The degree of flexibility of the linker between the 
amino acid sequences may be modulated by the choice of amino acids used 
to construct the linker. The combination of glycine and serine is useful for 

20 producing a flexible, relatively unrestrictive linker. A more rigid linker may 
be constructed by using amino acids with more complex side chains within 
the linkage sequence. 

2. Characterization Of Specific Dimers 

Specific dimers which are comprised of monomer subunits that both 
25 bind with high affinity to the same site on IR or IGF-1R (e.g., Site 1-Site 1 or 
Site 2-Site 2), or monomer subunits that bind to different sites on IR or IGF- 
1R (e.g., Site 1-Site 2 or Site 2-Site 1) are disclosed herein. 

Other combinations of peptides are within the scope of this invention 
and may be determined as demonstrated in the examples described herein. 
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F. Peptide Synthesis 

Many conventional techniques in molecular biology, protein 
biochemistry, and immunology may be used to produce the amino acid 
sequences for use with this invention. The present invention encompasses 
5 the specific amino acid sequences shown in Figures 1-4, 8, and 9 and Table 
7, inter alia, without additions (e.g., linker or spacer sequences) deletions, 
alterations, or modification. The present invention further encompasses 
variants that include additional sequences, altered sequences, and 
functional fragments thereof. In a preferred embodiment, the amino acid 

10 sequence variant or fragment shares at least one function characteristic 
(e.g., binding, agonist, or antagonist activity) of the reference sequence. 
Variant peptides include, for example, genetically engineered mutants, and 
may differ from the amino acid sequences shown in the figures and tables of 
the application by the addition, deletion, or substitution of one or more 

15 amino acid residues. Alterations may occur at the amino- or carboxy- 
terminal positions of the reference amino acid sequence or anywhere 
between those terminal positions, interspersed either individually among the 
amino acids in the reference sequence or in one or more contiguous groups 
within the reference sequence. In addition, variants may comprise synthetic 

20 or non-naturally occurring amino acids in accordance with this invention. 

Variant amino acid sequences can have conservative changes, 
wherein a substituted amino acid has similar structural or chemical 
properties, e.g., replacement of leucine with isoleucine. More infrequently, a 
variant peptide can have non-conservative changes, e.g., substitution of a 

25 glycine with a tryptophan. Guidance in determining which amino acid 
residues can be substituted, inserted, or deleted without abolishing binding 
or biological activity can be found using computer programs well-known in 
the art, for example, DNASTAR software (DNASTAR, Inc., Madison, Wl). 
Guidance is also provided by the data disclosed herein. In particular, 

30 Figures 1-4, 8, 9, 43, 44, and Table 7, inter alia, teach which amino acid 



WO 03/027246 



PCT/US02/30412 



-55- 

residues can be deleted, added, substituted, or modified, while maintaining 
the IR- or IGF-1 R-related function(s) (e.g., binding, agonist, or antagonist 
activity) of the amino acid sequences. 

For the purposes of this invention, the amino acids are grouped as 
5 follows: amino acids possessing alcohol groups are serine (S) and 
threonine (T). Aliphatic amino acids are isoleucine (I), leucine (L), valine 
(V), and methionine (M). Aromatic amino acids are phenylalanine (F), 
histidine (H), tryptophan (W), and tyrosine (Y). Hydrophobic amino acids 
are alanine (A), cysteine (C), phenylalanine (F), glycine (G), histidine (H), 

10 isoleucine (I), leucine (L), methionine (M), arginine (R), threonine (T), valine 
(V), tryptophan (W), and tyrosine (Y). Negative amino acids are aspartic 
acid (D) and glutamic acid (E). The following amino acids are polar amino 
acids: cysteine (C), aspartic acid (D), glutamic acid (E), histidine (H), lysine 
(K), asparagine (N), glutamine (Q), arginine (R), serine (S), and threonine 

15 (T). Positive amino acids are histidine (H), lysine (K), and arginine (R). 
Small amino acids are alanine (A), cysteine (C), aspartic acid (D), glycine 
(G), asparagine (N), proline (P), serine (S), threonine (T), and valine (V). 
Very small amino acids are alanine (A), glycine (G) and serine (S). Amino 
acids likely to be involved in a turn formation are alanine (A), cysteine (C), 

20 aspartic acid (D), glutamic acid (E), glycine (G), histidine (H), lysine (K), 
asparagine (N), glutamine (Q), arginine (R), serine (S), proline (P), and 
threonine (T). As non-limiting examples, the amino acids within each of 
these defined groups may be substituted for each other in the formulas 
described above, as conservative substitutions, subject to the specific 

25 preferences stated herein. 

Substantial changes in function can be made by selecting 
substitutions that are less conservative than those shown in the defined 
groups, above. For example, non-conservative substitutions can be made 
which more significantly affect the structure of the peptide in the area of the 

30 alteration, for example, the alpha-helical, or beta-sheet structure; the charge 
or hydrophobicity of the molecule at the target site; or the bulk of the side 
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chain. The substitutions which generally are expected to produce the 
greatest changes in the peptide's properties are those where 1) a 
hydrophilic residue, e.g., seryl or threonyl, is substituted for (or by) a 
hydrophobic residue, e.g., leucyl, isoleucyl, phenylalanyl, valyl, or alanyl; 2) 
5 a cysteine or proline is substituted for (or by) any other residue; 3) a residue 
having an electropositive side chain, e.g., lysyl, arginyl, or histidyl, is 
substituted for (or by) an electronegative residue, e.g., glutamyl or aspartyl; 
or 4) a residue having a bulky side chain, e.g., phenylalanine, is substituted 
for (or by) a residue that does not have a side chain, e.g., glycine. 

10 Amino acid preferences have been identified for certain peptides and 

peptide groups of the present invention. For example, amino acid 
preferences for the RP9, D8, and Group 6 (Formula 10) peptides are shown 
in Tables 17-19, below. In some instances, cysteine pairs may also be 
preferred. For example, cysteine pairs are preferred in certain Formula 1 

15 and Formula 2 sequences described herein. In accordance with the 
invention, the amino acid sequences of the invention may include two or 
more cysteine residues, which may be separated by at least 3, 4, 5, 6, 7, 8, 
9, 10, 11, 12, 13, 14, 15, 16, 17, 18, or 19 amino acids, and may be 
positioned inside or outside the Formula 1 or Formula 2 motif sequence. 

20 Preferably, the cysteines are separated by 17 or 18 amino acids. 

Variants also include amino acid sequences in which one or more 
residues are modified (i.e., by phosphorylation, sulfation, acylation, 
PEGylation, etc.), and mutants comprising one or more modified residues. 
Amino acid sequences may also be modified with a label capable of 

25 providing a detectable signal, either directly or indirectly, including, but not 
limited to, radioisotope, fluorescent, and enzyme labels. Fluorescent labels 
include, for example, Cy3, Cy5, Alexa, BODIPY, fluorescein (e.g., FluorX, 
DTAF, and FITC), rhodamine (e.g., TRITC), auramine, Texas Red, AMCA 
blue, and Lucifer Yellow. Preferred isotope labels include 3 H, 14 C, 32 P, 35 

30 S, 36 CI, 51 Cr, 57 Co, 58 Co, 59 Fe, 90 Y, 125 I, 131 I, and 186 Re. Preferred 
enzyme labels include peroxidase, p-glucuronidase, (3-D-glucosidase, p-D- 
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galactosidase, urease, glucose oxidase plus peroxidase, and alkaline 
phosphatase (see, e.g., U.S. Pat. Nos. 3,654,090; 3,850,752 and 
4,016,043). Enzymes can be conjugated by reaction with bridging 
molecules such as carbodiimides, diisocyanates, glutaraldehyde, and the 
5 like. Enzyme labels can be detected visually, or measured by calorimetric, 
spectrophotometric, fluorospectrophotometric, amperometric, or gasometric 
techniques. Other labeling systems, such as avidin/biotin, Tyramide Signal 
Amplification (TSA™), are known in the art, and are commercially available 
(see, e.g., ABC kit, Vector Laboratories, Inc., Burlingame, CA; NEN® Life 
10 Science Products, Inc., Boston, MA). 

1 . Recombinant Synthesis of Peptides 

To obtain recombinant peptides, DNA sequences encoding these 
peptides may be cloned into any suitable vectors for expression in intact 
host cells or in cell-free translation systems by methods well-known in the 

15 art (see Sambrook et al. y 1989). The particular choice of the vector, host, or 
translation system is not critical to the practice of the invention. 

A large number of vectors, including bacterial, yeast, and mammalian 
vectors, have been described for replication and/or expression in various 
host cells or cell-free systems, and may be used for gene therapy as well as 

20 for simple cloning or protein expression. In one aspect of the present 
invention, an expression vector comprises a nucleic acid encoding a IR or 
IGF-1R agonist or antagonist peptide, as described herein, operably linked 
to at least one regulatory sequence. Regulatory sequences are known in 
the art and are selected to direct expression of the desired protein in an 

25 appropriate host cell. Accordingly, the term regulatory sequence includes 
promoters, enhancers and other expression control elements (see D.V. 
Goeddel (1990) Methods Enzymol. 185:3-7). Enhancer and other 
expression control sequences are described in Enhancers and Eukaryotic 
Gene Expression, Cold Spring Harbor Press, Cold Spring Harbor, NY 

30 (1983). It should be understood that the design of the expression vector 
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may depend on such factors as the choice of the host cell to be transfected 
and/or the type of peptide desired to be expressed. 

Several regulatory elements (e.g., promoters) have been isolated and 
shown to be effective in the transcription and translation of heterologous 
5 proteins in the various hosts. Such regulatory regions, methods of isolation, 
manner of manipulation, etc. are known in the art. Non-limiting examples of 
bacterial promoters include the p-lactamase (penicillinase) promoter; lactose 
promoter; tryptophan (trp) promoter; araBAD (arabinose) operon promoter; 
lambda-derived promoter and N gene ribosome binding site; and the 

10 hybrid tac promoter derived from sequences of the trp and lac UV5 
promoters. Non-limiting examples of yeast promoters include the 3- 
phosphoglycerate kinase promoter, glyceraldehyde-3-phosphate 
dehydrogenase (GAPDH) promoter, galactokinase (GAL1) promoter, 
galactoepimerase promoter, and alcohol dehydrogenase (ADH1) promoter. 

15 Suitable promoters for mammalian cells include, without limitation, viral 
promoters, such as those from Simian Virus 40 (SV40), Rous sarcoma virus 
(RSV), adenovirus (ADV), and bovine papilloma virus (BPV). Preferred 
replication and inheritance systems include M13, ColE1, SV40, baculovirus, 
lambda, adenovirus, CEN ARS, 2^im ARS and the like. While expression 

20 vectors may replicate autonomously, they may also replicate by being 
inserted into the genome of the host cell, by methods well-known in the art. 

To obtain expression in eukaryotic cells, terminator sequences, 
polyadenylation sequences, and enhancer sequences that modulate gene 
expression may be required. Sequences that cause amplification of the 

25 gene may also be desirable. Furthermore, sequences that facilitate 
secretion of the recombinant product from cells, including, but not limited to, 
bacteria, yeast, and animal cells, such as secretory signal sequences and/or 
preprotein or proprotein sequences, may also be included. These 
sequences are well-described in the art. DNA sequences can be optimized, 

30 if desired, for more efficient expression in a given host organism or 
expression system. For example, codons can be altered to conform to the 
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preferred codon usage in a given host cell or cell-free translation system 
using well-established techniques. 

Codon usage data can be obtained from publicly-available sources, 
for example, the Codon Usage Database at http://www.kazusa.or.jp/codon/. 
5 In addition, computer programs that translate amino acid sequence 
information into nucleotide sequence information in accordance with codon 
preferences (i.e., backtranslation programs) are widely available. See, for 
example, Backtranslate program from Genetics Computer Group (GCG), 
Accelrys, Inc., Madison, Wl; and Backtranslation Applet from Entelechon 
10 GmbH, Regensburg, Germany. Thus, using the peptide sequences 
disclosed herein, one of ordinary skill in the art can design nucleic acids to 
yield optimal expression levels in the translation system or host cell of 
choice. 

Expression and cloning vectors will likely contain a selectable marker, 
15 a gene encoding a protein necessary for survival or growth of a host cell 
transformed with the vector. The presence of this gene ensures growth of 
only those host cells that express the inserts. Typical selection genes 
encode proteins that 1) confer resistance to antibiotics or other toxic 
substances, e.g., ampicillin, neomycin, methotrexate, etc.; 2) complement 
20 auxotrophic deficiencies, or 3) supply critical nutrients not available from 
complex media, e.g., the gene encoding D-alanine racemase for Bacilli. 
Markers may be an inducible or non-inducible gene and will generally allow 
for positive selection. Non-limiting examples of markers include the 
ampicillin resistance marker (i.e., beta-lactamase), tetracycline resistance 
25 marker, neomycin/kanamycin resistance marker (i.e., neomycin 
phosphotransferase), dihydrofolate reductase, glutamine synthetase, and 
the like. The choice of the proper selectable marker will depend on the host 
cell, and appropriate markers for different hosts as understood by those of 
skill in the art. 

30 Suitable expression vectors for use with the present invention 

include, but are not limited to, pUC, pBluescript (Stratagene), pET 
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(Novagen, Inc., Madison, Wl), and pREP (Invitrogen) plasmids. Vectors can 
contain one or more replication and inheritance systems for cloning or 
expression, one or more markers for selection in the host, e.g., antibiotic 
resistance, and one or more expression cassettes. The inserted coding 
5 sequences can be synthesized by standard methods, isolated from natural 
sources, or prepared as hybrids. Ligation of the coding sequences to 
transcriptional regulatory elements (e.g., promoters, enhancers, and/or 
insulators) and/or to other amino acid encoding sequences can be carried 
out using established methods. 

10 Suitable cell-free expression systems for use with the present 

invention include, without limitation, rabbit reticulocyte lysate, wheat germ 
extract, canine pancreatic microsomal membranes, E. coli S30 extract, and 
coupled transcription/translation systems (Promega Corp., Madison, Wl). 
These systems allow the expression of recombinant peptides upon the 

15 addition of cloning vectors, DNA fragments, or RNA sequences containing 
protein-coding regions and appropriate promoter elements. 

Non-limiting examples of suitable host cells include bacteria, archea, 
insect, fungi (e.g., yeast), plant, and animal cells (e.g., mammalian, 
especially human). Of particular interest are Escherichia coli, Bacillus 

20 subtilis, Saccharomyces cerevisiae, SF9 cells, C129 cells, 293 cells, 
Neurospora, and immortalized mammalian myeloid and lymphoid cell lines. 
Techniques for the propagation of mammalian cells in culture are well- 
known (see, Jakoby and Pastan (Eds), 1979, Cell Culture. Methods in 
Enzymology, volume 58, Academic Press, Inc., Harcourt Brace Jovanovich, 

25 NY). Examples of commonly used mammalian host cell lines are VERO and 
HeLa cells, CHO cells, and WI38, BHK, and COS cell lines, although it will 
be appreciated by the skilled practitioner that other cell lines may be used, 
e.g., to provide higher expression, or other features. 

Host cells can be transformed, transfected, or infected as appropriate 

30 by any suitable method including electroporation, calcium chloride-, lithium 
chloride-, lithium acetate/polyethylene glycol-, calcium phosphate-, DEAE- 
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dextran-, liposome-mediated DNA uptake, spheroplasting, injection, 
microinjection, microprojectile bombardment, phage infection, viral infection, 
or other established methods. Alternatively, vectors containing the nucleic 
acids of interest can be transcribed in vitro, and the resulting RNA 
5 introduced into the host cell by well-known methods, e.g., by injection (see, 
Kubo et al., 1988, FEBS Letts. 241:119). The cells into which have been 
introduced nucleic acids described above are meant to also include the 
progeny of such cells. 

Nucleic acids encoding the peptides of the invention may be isolated 

10 directly from recombinant phage libraries (e.g., RAPIDLIB® or GRABLIB® 
libraries) described herein. Alternatively, the polymerase chain reaction 
(PGR) method can be used to produce nucleic acids of the invention, using 
the recombinant phage libraries as templates. Primers used for PCR can be 
synthesized using the sequence information provided herein and can further 

15 be designed to introduce appropriate new restriction sites, if desirable, to 
facilitate incorporation into a given vector for recombinant expression. 

Nucleic acids encoding the peptides of the present invention can also 
be produced by chemical synthesis, e.g., by the phosphoramidite method 
described by Beaucage et a/., 1981, Tetra. Letts. 22:1859-1862, or the 

20 triester method according to Matteucci ef a/., 1981, J. Am. Chem. Soc, 
103:3185, and can performed on commercial, automated oligonucleotide 
synthesizers. A double-stranded fragment may be obtained from the single- 
stranded product of chemical synthesis either by synthesizing the 
complementary strand and annealing the strands together under appropriate 

25 conditions or by adding the complementary strand using DNA polymerase 
with an appropriate primer sequence. 

The nucleic acids encoding the peptides of the invention can be 
produced in large quantities by replication in a suitable host cell. Natural or 
synthetic nucleic acid fragments, comprising at least ten contiguous bases 

30 coding for a desired amino acid sequence can be incorporated into 
recombinant nucleic acid constructs, usually DNA constructs, capable of 
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introduction into and replication in a prokaryotic or eukaryotic cell. Usually 
the nucleic acid constructs will be suitable for replication in a unicellular 
host, such as yeast or bacteria, but may also be intended for introduction to 
(with and without integration within the genome) cultured mammalian or 
5 plant or other eukaryotic cells, cell lines, tissues, or organisms. The 
purification of nucleic acids produced by the methods of the present 
invention is described, for example, in Sambrook et a/., 1989; F.M. Ausubel 
et a/., 1992, Current Protocols in Molecular Biology, J. Wiley and Sons, New 
York, NY. 

10 These nucleic acids can encode variant or truncated forms of the 

peptides as well as the reference peptides shown in Figures 1-4, 8, and 9 
and Table 7, inter alia. Large quantities of the nucleic acids and peptides of 
the present invention may be prepared by expressing the nucleic acids or 
portions thereof in vectors or other expression vehicles in compatible 

15 prokaryotic or eukaryotic host cells. The most commonly used prokaryotic 
hosts are strains of Escherichia coli, although other prokaryotes, such as 
Bacillus subtilis or Pseudomonas may also be used. Mammalian or other 
eukaryotic host cells, such as those of yeast, filamentous fungi, plant, insect, 
or amphibian or avian species, may also be useful for production of the 

20 proteins of the present invention. For example, insect cell systems (i.e., 
lepidopteran host cells and baculovirus expression vectors) are particularly 
suited for large-scale protein production. 

Host cells carrying an expression vector (i.e., transformants or 
clones) are selected using markers depending on the mode of the vector 

25 construction. The marker may be on the same or a different DNA molecule, 
preferably the same DNA molecule. In prokaryotic hosts, the transformant 
may be selected, e.g., by resistance to ampicillin, tetracycline or other 
antibiotics. Production of a particular product based on temperature 
sensitivity may also serve as an appropriate marker. 

30 For some purposes, it is preferable to produce the peptide in a 

recombinant system in which the peptide contains an additional sequence 
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(e.g., epitope or protein) tag that facilitates purification. Non-limiting 
examples of epitope tags include c-myc, haemagglutinin (HA), polyhistidine 
(6X-HIS)(SEQ ID NO:1778), GLU-GLU, and DYKDDDDK (SEQ ID 
NO:1779) or DYKD (SEQ ID NO:1545; FLAG®) epitope tags. Non-limiting 
5 examples of protein tags include glutathione-S-transferase (GST), green 
fluorescent protein (GFP), and maltose binding protein (MBP). In one 
approach, the coding sequence of a peptide can be cloned into a vector that 
creates a fusion with a sequence tag of interest. Suitable vectors include, 
without limitation, pRSET (Invitrogen Corp., San Diego, CA), pGEX 

10 (Amersham Pharmacia Biotech, Inc., Piscataway, NJ), pEGFP (CLONTECH 
Laboratories, Inc., Palo Alto, CA), and pMAL™ (New England BioLabs, Inc., 
Beverly, MA) plasmids. Following expression, the epitope or protein tagged 
peptide can be purified from a crude lysate of the translation system or host 
cell by chromatography on an appropriate solid-phase matrix. In some 

15 cases, it may be preferable to remove the epitope or protein tag (i.e., via 
protease cleavage) following purification. 

Methods for directly purifying peptides from sources such as cellular 
or extracellular lysates are well-known in the art (see Harris and Angal, 
1989). Such methods include, without limitation, sodium dodecylsulfate- 

20 polyacrylamide gel electrophoresis (SDS-PAGE), preparative disc-gel 
electrophoresis, isoelectric focusing, high-performance liquid 
chromatography (HPLC), reversed-phase HPLC, gel filtration, ion exchange 
and partition chromatography, countercurrent distribution, and combinations 
thereof. Peptides can be purified from many possible sources, for example, 

25 plasma, body tissues, or body fluid lysates derived from human or animal, 
including mammalian, bird, fish, and insect sources. 

Antibody-based methods may also be used to purify peptides. 
Antibodies that recognize these peptides or fragments derived therefrom 
can be produced and isolated. The peptide can then be purified from a 

30 crude lysate by chromatography on an antibody-conjugated solid-phase 
matrix (see Harlow and Lane, 1998). 
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2. Chemical Synthesis Of Peptides 

Alternately, peptides may be chemically synthesized by commercially 
available automated procedures, including, without limitation, exclusive solid 
phase synthesis, partial solid phase methods, fragment condensation or 
5 classical solution synthesis. The peptides are preferably prepared by solid- 
phase peptide synthesis; for example, as described by Merrifield (1965; 
1997). 

According to methods known in the art, peptides can be chemically 
synthesized by commercially available automated procedures, including, 

10 without limitation, exclusive solid phase synthesis, partial solid phase 
methods, fragment condensation, classical solution synthesis. In addition, 
recombinant and synthetic methods of peptide production can be combined 
to produce semi-synthetic peptides. The peptides of the invention are 
preferably prepared by solid phase peptide synthesis as described by 

15 Merrifield, 1963, J. Am. Chem. Soc. 85:2149; 1997. In one embodiment, 
synthesis is carried out with amino acids that are protected at the alpha- 
amino terminus. Trifunctional amino acids with labile side-chains are also 
protected with suitable groups to prevent undesired chemical reactions from 
occurring during the assembly of the peptides. The alpha-amino protecting 

20 group is selectively removed to allow subsequent reaction to take place at 
the amino-terminus. The conditions for the removal of the alpha-amino 
protecting group do not remove the side-chain protecting groups. 

The alpha-amino protecting groups are those known to be useful in 
the art of stepwise peptide synthesis. Included are acyl type protecting 

25 groups, e.g., formyl, trifluoroacetyl, acetyl, aromatic urethane type protecting 
groups, e.g., benzyloxycarbonyl (Cbz), substituted benzyloxycarbonyl and 9- 
fluorenylmethyloxycarbonyl (Fmoc), aliphatic urethane protecting groups, 
e.g., t-butyloxycarbonyl (Boc), isopropyloxycarbonyl, cyclohexyloxycarbonyl, 
and alkyl type protecting groups, e.g., benzyl, triphenylmethyl. The 

30 preferred protecting group is Boc. The side-chain protecting groups for Tyr 
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include tetrahydropyranyl, tert-butyl, trityl, benzyl, Cbz, 4-Br-Cbz and 2,6- 
dichlorobenzyl. The preferred side-chain protecting group for Tyr is 2,6- 
dichlorobenzyl. The side-chain protecting groups for Asp include benzyl, 
2,6-dichlorobenzyl, methyl, ethyl, and cyclohexyl. The preferred side-chain 
5 protecting group for Asp is cyclohexyl. The side-chain protecting groups for 
Thr and Ser include acetyl, benzoyl, trityl, tetrahydropyranyl, benzyl, 2,6- 
dichlorobenzyl, and Cbz. The preferred protecting group for Thr and Ser is 
benzyl. The side-chain protecting groups for Arg include nitro, Tos, Cbz, 
adamantyloxycarbonyl, and Boc. The preferred protecting group for Arg is 
10 Tos. The side-chain amino group of Lys can be protected with Cbz, 2-CI- 
Cbz, Tos, or Boc. The 2-CI-Cbz group is the preferred protecting group for 
Lys. 

The side-chain protecting groups selected must remain intact during 
coupling and not be removed during the deprotection of the amino-terminus 

15 protecting group or during coupling conditions. The side-chain protecting 
groups must also be removable upon the completion of synthesis, using 
reaction conditions that will not alter the finished peptide. 

Solid phase synthesis is usually carried out from the carboxy- 
terminus by coupling the alpha-amino protected (side-chain protected) 

20 amino acid to a suitable solid support. An ester linkage is formed when the 
attachment is made to a chloromethyl or hydroxymethyl resin, and the 
resulting peptide will have a free carboxyl group at the C-terminus. 
Alternatively, when a benzhydrylamine or p-methylbenzhydrylamine resin is 
used, an amide bond is formed and the resulting peptide will have a 

25 carboxamide group at the C-terminus. These resins are commercially 
available, and their preparation has described by Stewart et al., 1984, Solid 
Phase Peptide Synthesis (2nd Edition), Pierce Chemical Co., Rockford, IL. 

The C-terminal amino acid, protected at the side chain if necessary 
and at the alpha-amino group, is coupled to the benzhydrylamine resin using 

30 various activating agents including dicyclohexylcarbodiimide (DCC), N,N'- 
diisopropyl-carbodiimide and carbonyldiimidazole. Following the attachment 
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to the resin support, the alpha-amino protecting group is removed using 
trifluoroacetic acid (TFA) or HCI in dioxane at a temperature between 0 and 
25°C. Dimethylsulfide is added to the TFA after the introduction of 
methionine (Met) to suppress possible S-alkylation. After removal of the 
5 alpha-amino protecting group, the remaining protected amino acids are 
coupled stepwise in the required order to obtain the desired sequence. 

Various activating agents can be used for the coupling reactions 
including DCC,N,N'-diisopropyl-carbodiimide, benzotriazol-1-yl-oxy-tris- 
(dimethylamino) phosphonium hexa-fluorophosphate (BOP) and DCC- 

10 hydroxy benzotriazole (HOBt). Each protected amino acid is used in excess 
(>2.0 equivalents), and the couplings are usually carried out in N- 
methylpyrrolidone (NMP) or in DMF, CH 2 CI 2 or mixtures thereof. The extent 
of completion of the coupling reaction is monitored at each stage, e.g., by 
the ninhydrin reaction as described by Kaiser et a/., 1970, Anal. Biochem. 

15 34:595. In cases where incomplete coupling is found, the coupling reaction 
is repeated. The coupling reactions can be performed automatically with 
commercially available instruments. 

After the entire assembly of the desired peptide, the peptide-resin is 
cleaved with a reagent such as liquid HF for 1-2 h at 0°C, which cleaves the 

20 peptide from the resin and removes all side-chain protecting groups. A 
scavenger such as anisole is usually used with the liquid HF to prevent 
cations formed during the cleavage from alkylating the amino acid residues 
present in the peptide. The peptide-resin can be deprotected with 
TFA/dithioethane prior to cleavage if desired. 

25 Side-chain to side-chain cyclization on the solid support requires the 

use of an orthogonal protection scheme which enables selective cleavage of 
the side-chain functions of acidic amino acids (e.g., Asp) and the basic 
amino acids (e.g., Lys). The 9-fluorenylmethyl (Fm) protecting group for the 
side-chain of Asp and the 9-fluorenylmethyloxycarbonyl (Fmoc) protecting 

30 group for the side-chain of Lys can be used for this purpose. In these 
cases, the side-chain protecting groups of the Boc-protected peptide-resin 
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are selectively removed with piperidine in DMF. Cyclization is achieved on 
the solid support using various activating agents including DCC, DCC/HOBt, 
or BOP. The HF reaction is carried out on the cyclized peptide-resin as 
described above. 

5 3. Peptide Libraries 

Peptide libraries produced and screened according to the present 
invention are useful in providing new ligands for IR and IGF-1R. Peptide 
libraries can be designed and panned according to methods described in 
detail herein, and methods generally available to those in the art (see, e.g., 

10 U.S. Patent No. 5,723,286 issued March 3, 1998 to Dower et a/.). In one 
aspect, commercially available phage display libraries can be used (e.g., 
RAPIDLIB® or GRABLIB®, DGI BioTechnologies, Inc., Edison, NJ; Ph.D. 
C7C Disulfide Constrained Peptide Library, New England Biolabs). In 
another aspect, an oligonucleotide library can be prepared according to 

15 methods known in the art, and inserted into an appropriate vector for peptide 
expression. For example, vectors encoding a bacteriophage structural 
protein, preferably an accessible phage protein, such as a bacteriophage 
coat protein, can be used. Although one skilled in the art will appreciate that 
a variety of bacteriophage may be employed in the present invention, in 

20 preferred embodiments the vector is, or is derived from, a filamentous 
bacteriophage, such as, for example, f1, fd, Pf1, M13, etc. In particular, the 
fd-tet vector has been extensively described in the literature (see, e.g., 
Zacher et a/., 1980, Gene 9:127-140; Smith et a/., 1985, Science 228:1315- 
1317; Parmley and Smith, 1988, Gene 73:305-318). 

25 The phage vector is chosen to contain or is constructed to contain a 

cloning site located in the 5' region of the gene encoding the bacteriophage 
structural protein, so that the peptide is accessible to receptors in an affinity 
enrichment procedure as described hereinbelow. The structural phage 
protein is preferably a coat protein. An example of an appropriate coat 

30 protein is pill. A suitable vector may allow oriented cloning of the 
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oligonucleotide sequences that encode the peptide so that the peptide is 
expressed at or within a distance of about 100 amino acid residues of the N- 
terminus of the mature coat protein. The coat protein is typically expressed 
as a preprotein, having a leader sequence. 
5 Thus, desirably the oligonucleotide library is inserted so that the N- 

terminus of the processed bacteriophage outer protein is the first residue of 
the peptide, i.e., between the 3-terminus of the sequence encoding the 
leader protein and the S'-terminus of the sequence encoding the mature 
protein or a portion of the 5' terminus. The library is constructed by cloning 

10 an oligonucleotide which contains the variable region of library members 
(and any spacers, as discussed below) into the selected cloning site. Using 
known recombinant DNA techniques (see generally, Sambrook et a/., 1989, 
Molecular Cloning, A Laboratory Manual, 2d ed., Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1989), an oligonucleotide may 

15 be constructed which, inter alia; 1) removes unwanted restriction sites and 
adds desired ones; 2) reconstructs the correct portions of any sequences 
which have been removed (such as a correct signal peptidase site, for 
example); 3) inserts the spacer residues, if any; and/or 4) corrects the 
translation frame (if necessary) to produce active, infective phage. 

20 The central portion of the oligonucleotide will generally contain one or 

more IR and/or IGF-1R binding sequences and, optionally, spacer 
sequences. The sequences are ultimately expressed as peptides (with or 
without spacers) fused to or in the N-terminus of the mature coat protein on 
the outer, accessible surface of the assembled bacteriophage particles. The 

25 size of the library will vary according to the number of variable codons, and 
hence the size of the peptides, which are desired. Generally the library will 
be at least about 10 6 members, usually at least 10 7 , and typically 10 8 or 
more members. To generate the collection of oligonucleotides which forms 
a series of codons encoding a random collection of amino acids and which 

30 is ultimately cloned into the vector, a codon motif is used, such as (NNK) X , 
where N may be A, C, G, or T (nominally equimolar), K is G or T (nominally 
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equimolar), and x is typically up to about 5, 6, 7, 8, or more, thereby 
producing libraries of penta-, hexa-, hepta-, and octa-peptides or larger. 
The third position may also be G or C, designated "S". Thus, NNK or NNS 
1) code for all the amino acids; 2) code for only one stop codon; and 3) 
5 reduce the range of codon bias from 6:1 to 3:1 . 

It should be understood that, with longer peptides, the size of the 
library that is generated may become a constraint in the cloning process. 
The expression of peptides from randomly generated mixtures of 
oligonucleotides in appropriate recombinant vectors is known in the art (see, 

10 e.g., Oliphant et a/., Gene 44:177-183). For example, the codon motif 
(NNK) 6 produces 32 codons, one for each of 12 amino acids, two for each of 
five amino acids, three for each of three amino acids and one (amber) stop 
codon. Although this motif produces a codon distribution as equitable as 
available with standard methods of oligonucleotide synthesis, it results in a 

15 bias against peptides containing one-codon residues. In particular, a 
complete collection of hexacodons contains one sequence encoding each 
peptide made up of only one-codon amino acids, but contains 729 (3 6 ) 
sequences encoding each peptide with only three-codon amino acids. 

An alternative approach to minimize the bias against one-codon 

20 residues involves the synthesis of 20 activated trinucleotides, each 
representing the codon for one of the 20 genetically encoded amino acids. 
These are synthesized by conventional means, removed from the support 
while maintaining the base and 5-OH-protecting groups, and activated by 
the addition of 3'O-phosphoramidite (and phosphate protection with b- 

25 cyanoethyl groups) by the method used for the activation of 
mononucleosides (see, generally, McBride and Caruthers, 1983, 
Tetrahedron Letters 22:245). Degenerate oligocodons are prepared using 
these trimers as building blocks. The trimers are mixed at the desired molar 
ratios and installed in the synthesizer. The ratios will usually be 

30 approximately equimolar, but may be a controlled unequal ratio to obtain the 
over- to under-representation of certain amino acids coded for by the 
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degenerate oligonucleotide collection. The condensation of the trimers to 
form the oligocodons is done essentially as described for conventional 
synthesis employing activated mononucleosides as building blocks (see, 
e.g., Atkinson and Smith, 1984, Oligonucleotide Synthesis, M.J. Gait, Ed., p. 
5 35-82). This procedure generates a population of oligonucleotides for 
cloning that is capable of encoding an equal distribution (or a controlled 
unequal distribution) of the possible peptide sequences. Advantageously, 
this approach may be employed in generating longer peptide sequences, 
since the range of bias produced by the (NNK) 6 motif increases by three-fold 

10 with each additional amino acid residue. 

When the codon motif is (NNK) X , as defined above, and when x 
equals 8, there are 2.6. x 10 10 possible octa-peptides. A library containing 
most of the octa-peptides may be difficult to produce. Thus, a sampling of 
the octa-peptides may be accomplished by constructing a subset library 

15 using up to about 10% of the possible sequences, which subset of 
recombinant bacteriophage particles is then screened. If desired, to extend 
the diversity of a subset library, the recovered phage subset may be 
subjected to mutagenesis and then subjected to subsequent rounds of 
screening. This mutagenesis step may be accomplished in two general 

20 ways: the variable region of the recovered phage may be mutagenized, or 
additional variable amino acids may be added to the regions adjoining the 
initial variable sequences. 

To diversify around active peptides (i.e., binders) found in early 
rounds of panning, the positive phage can sequenced to determine the 

25 identity of the active peptides. Oligonucleotides can then be synthesized 
based on these peptide sequences. The syntheses are done with a low 
level of all bases incorporated at each step to produce slight variations of 
the primary oligonucleotide sequences. This mixture of (slightly) degenerate 
oligonucleotides can then be cloned into the affinity phage by methods 

30 known to those in the art. This method produces systematic, controlled 
variations of the starting peptide sequences as part of a secondary library. It 
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requires, however, that individual positive phage be sequenced before 
mutagenesis, and thus is useful for expanding the diversity of small numbers 
of recovered phage. 

An alternate approach to diversify the selected phage allows the 
5 mutagenesis of a pool, or subset, of recovered phage. In accordance with 
this approach, phage recovered from panning are pooled and single 
stranded DNA is isolated. The DNA is mutagenized by treatment with, e.g., 
nitrous acid, formic acid, or hydrazine. These treatments produce a variety 
of damage to the DNA. The damaged DNA is then copied with reverse 

10 transcriptase, which reincorporates bases when it encounters a site of 
damage. The segment containing the sequence encoding the receptor- 
binding peptide is then isolated by cutting with restriction nuclease(s) 
specific for sites flanking the peptide coding sequence. This mutagenized 
segment is then recloned into undamaged vector DNA, the DNA is 

15 transformed into cells, and a secondary library according to known methods. 
General mutagenesis methods are known in the art (see Myers et a/., 1985, 
Nucl. Acids Res. 13:3131-3145; Myers et a/., 1985, Science 229:242-246; 
Myers, 1989, Current Protocols in Molecular Biology Vol. /, 8.3.1-8.3.6, F. 
Ausubel et a/., eds, J. Wiley and Sons, New York). 

20 In another general approach, the addition of amino acids to a peptide 

or peptides found to be active, can be carried out using various methods. In 
one, the sequences of peptides selected in early panning are determined 
individually and new oligonucleotides, incorporating the determined 
sequence and an adjoining degenerate sequence, are synthesized. These 

25 are then cloned to produce a secondary library. Alternatively, methods can 
be used to add a second IR or IGF-1R binding sequence to a pool of 
peptide-bearing phage. In accordance with one method, a restriction site is 
installed next to the first IR or IGF-1R binding sequence. Preferably, the 
enzyme should cut outside of its recognition sequence. The recognition site 

30 may be placed several bases from the first binding sequence. To insert a 
second IR or IGF-1R binding sequence, the pool of phage DNA is digested 
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and blunt-ended by filling in the overhang with Klenow fragment. Double- 
stranded, blunt-ended, degenerately synthesized oligonucleotides are then 
ligated into this site to produce a second binding sequence juxtaposed to the 
first binding sequence. This secondary library is then amplified and 
5 screened as before. 

While in some instances it may be appropriate to synthesize longer 
peptides to bind certain receptors, in other cases it may be desirable to 
provide peptides having two or more IR or IGF-1R binding sequences 
separated by spacer (e.g., linker) residues. For example, the binding 

10 sequences may be separated by spacers that allow the regions of the 
peptides to be presented to the receptor in different ways. The distance 
between binding regions may be as little as 1 residue, or at least 2-20 
residues, or up to at least 100 residues. Preferred spacers are 3, 6, 9, 12, 
15, or 18 residues in length. For probing large binding sites or tandem 

15 binding sites (e.g., Site 1 and Site 2 of IR), the binding regions may be 
separated by a spacer of residues of up to 20 to 30 amino acids. The 
number of spacer residues when present will typically be at least 2 residues, 
and often will be less than 20 residues. 

The oligonucleotide library may have binding sequences which are 

20 separated by spacers (e.g., linkers), and thus may be represented by the 
formula: (NNK)y - (abc) n - (NNK) Z where N and K are as defined previously 
(note that S as defined previously may be substituted for K), and y+z is 
equal to about 5, 6, 7, 8, or more, a, b and c represent the same or different 
nucleotides comprising a codon encoding spacer amino acids, n is up to 

25 about 3, 6, 9, or 12 amino acids, or more. The spacer residues may be 
somewhat flexible, comprising oligo-glycine, or oligo-glycine-glycine-serine, 
for example, to provide the diversity domains of the library with the ability to 
interact with sites in a large binding site relatively unconstrained by 
attachment to the phage protein. Rigid spacers, such as, e.g., oligo-proline, 

30 may also be inserted separately or in combination with other spacers, 
including glycine spacers. It may be desired to have the IR or IGF-1R 
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binding sequences close to one another and use a spacer to orient the 
binding sequences with respect to each other, such as by employing a turn 
between the two sequences, as might be provided by a spacer of the 
sequence glycine-proline-glycine, for example. To add stability to such a 
5 turn, it may be desirable or necessary to add cysteine residues at either or 
both ends of each variable region. The cysteine residues would then form 
disulfide bridges to hold the variable regions together in a loop, and in this 
fashion may also serve to mimic a cyclic peptide. Of course, those skilled in 
the art will appreciate that various other types of covalent linkages for 

10 cyclization may also be used. 

Spacer residues as described above may also be situated on either 
or both ends of the IR or IGF-1 R binding sequences. For instance, a cyclic 
peptide may be designed without an intervening spacer, by having a 
cysteine residue on both ends of the peptide. As described above, flexible 

15 spacers, e.g., oligo-glycine, may facilitate interaction of the peptide with the 
selected receptors. Alternatively, rigid spacers may allow the peptide to be 
presented as if on the end of a rigid arm, where the number of residues, 
e.g., proline residues, determines not only the length of the arm but also the 
direction for the arm in which the peptide is oriented. Hydrophilic spacers, 

20 made up of charged and/or uncharged hydrophilic amino acids, (e.g., Thr, 
His, Asn, Gin, Arg, Glu, Asp, Met, Lys, etc.), or hydrophobic spacers of 
hydrophobic amino acids (e.g., Phe, Leu, lie, Gly, Val, Ala, etc.) may be 
used to present the peptides to receptor binding sites with a variety of local 
environments. 

25 Notably, some peptides, because of their size and/or sequence, may 

cause severe defects in the infectivity of their carrier phage. This causes a 
loss of phage from the population during reinfection and amplification 
following each cycle of panning. To minimize problems associated with 
defective infectivity, DNA prepared from the eluted phage can be 

30 transformed into appropriate host cells, such as, e.g., E. co//, preferably by 
electroporation (see, e.g., Dower et a/., Nucl. Acids Res. 16:6127-6145), or 
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well-known chemical means. The cells are cultivated for a period of time 
sufficient for marker expression, and selection is applied as typically done 
for DNA transformation. The colonies are amplified, and phage harvested 
for affinity enrichment in accordance with established methods. Phage 
5 identified in the affinity enrichment may be re-amplified by infection into the 
host cells. The successful transformants are selected by growth in an 
appropriate antibiotic(s), e.g., tetracycline or ampicillin. This may be done 
on solid or in liquid growth medium. 

For growth on solid medium, the cells are grown at a high density 

10 (about 10 8 to 10 9 transformants per m 2 ) on a large surface of, for example, 
L-agar containing the selective antibiotic to form essentially a confluent 
lawn. The cells and extruded phage are scraped from the surface and 
phage are prepared for the first round of panning (see, e.g., Parmley and 
Smith, 1988, Gene 73:305-318). For growth in liquid culture, cells may be 

15 grown in L-broth and antibiotic through about 10 or more doublings. The 
phage are harvested by standard procedures (see Sambrook et aL % 1989, 
Molecular Cloning, 2 nd ed.). Growth in liquid culture may be more 
convenient because of the size of the libraries, while growth on solid media 
likely provides less chance of bias during the amplification process. 

20 For affinity enrichment of desired clones, generally about 10 3 to 10 4 

library equivalents (a library equivalent is one of each recombinant; 10 4 
equivalents of a library of 10 9 members is 10 9 x 10 4 = 10 13 phage), but 
typically at least 10 2 library equivalents, up to about 10 5 to 10 6 , are 
incubated with a receptor (or portion thereof) to which the desired peptide is 

25 sought. The receptor is in one of several forms appropriate for affinity 
enrichment schemes. In one example the receptor is immobilized on a 
surface or particle, and the library of phage bearing peptides is then panned 
on the immobilized receptor generally according to procedures known in the 
art. In an alternate scheme, a receptor is attached to a recognizable ligand 

30 (which may be attached via a tether). A specific example of such a ligand is 
biotin. The receptor, so modified, is incubated with the library of phage and 
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binding occurs with both reactants in solution. The resulting complexes are 
then bound to streptavidin (or avidin) through the biotin moiety. The 
streptavidin may be immobilized on a surface such as a plastic plate or on 
particles, in which case the complexes 

5 (phage/peptide/receptor/biotin/streptavidin) are physically retained; or the 
streptavidin may be labeled, with a fluorophor, for example, to tag the active 
phage/peptide for detection and/or isolation by sorting procedures, e.g., on a 
fluorescence-activated cell sorter. 

Phage that associate with IR or IGF-1R via non-specific interactions 

10 are removed by washing. The degree and stringency of washing required 
will be determined for each receptor/peptide of interest. A certain degree of 
control can be exerted over the binding characteristics of the peptides 
recovered by adjusting the conditions of the binding incubation and the 
subsequent washing. The temperature, pH, ionic strength, divalent cation 

15 concentration, and the volume and duration of the washing will select for 
peptides within particular ranges of affinity for the receptor. Selection based 
on slow dissociation rate, which is usually predictive of high affinity, is the 
most practical route. This may be done either by continued incubation in the 
presence of a saturating amount of free ligand, or by increasing the volume, 

20 number, and length of the washes. In each case, the rebinding of 
dissociated peptide-phage is prevented, and with increasing time, peptide- 
phage of higher and higher affinity are recovered. Additional modifications 
of the binding and washing procedures may be applied to find peptides that 
bind receptors under special conditions. Once a peptide sequence that 

25 imparts some affinity and specificity for the receptor molecule is known, the 
diversity around this binding motif may be embellished. For instance, 
variable peptide regions may be placed on one or both ends of the identified 
sequence. The known sequence may be identified from the literature, or 
may be derived from early rounds of panning in the context of the present 

30 invention. 
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G. Screening Assays 

In another embodiment of this invention, screening assays to identify 
pharmacologically active ligands at IR and/or IGF-1R are provided. Ligands 
may encompass numerous chemical classes, though typically they are 
5 organic molecules, preferably small organic compounds having a molecular 
weight of more than 50 and less than about 2,500 daltons. Such ligands 
can comprise functional groups necessary for structural interaction with 
proteins, particularly hydrogen bonding, and typically include at least an 
amine, carbonyl, hydroxyl or carboxyl group, preferably at least two of the 

10 functional chemical groups. Ligands often comprise cyclical carbon or 
heterocyclic structures and/or aromatic or polyaromatic structures 
substituted with one or more of the above functional groups. Ligands can 
also comprise biomolecules including peptides, saccharides, fatty acids, 
steroids, purines, pyrimidines, derivatives, structural analogs, or 

1 5 combinations thereof. 

Ligands may include, for example, 1) peptides such as soluble 
peptides, including Ig-tailed fusion peptides and members of random peptide 
libraries (see, e.g., Lam et a/., 1991, Nature 354:82-84; Houghten et a/., 
1991, Nature 354:84-86) and combinatorial chemistry-derived molecular 

20 libraries made of D- and/or L-configuration amino acids; 2) phosphopeptides 
(e.g., members of random and partially degenerate, directed 
phosphopeptide libraries, see, e.g., Songyang et a/., 1993, Cell 72:767-778); 
3) antibodies (e.g., polyclonal, monoclonal, humanized, anti-idiotypic, 
chimeric, and single chain antibodies as well as Fab, F(ab') 2 , Fab 

25 expression library fragments, and epitope-binding fragments of antibodies); 
and 4) small organic and inorganic molecules. 

Ligands can be obtained from a wide variety of sources including 
libraries of synthetic or natural compounds. Synthetic compound libraries 
are commercially available from, for example, Maybridge Chemical Co. 

30 (Trevillet, Cornwall, UK), Comgenex (Princeton, NJ), Brandon Associates 
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(Merrimack, NH), and Microsource (New Milford, CT). A rare chemical 
library is available from Aldrich Chemical Company, Inc. (Milwaukee, Wl). 
Natural compound libraries comprising bacterial, fungal, plant or animal 
extracts are available from, for example, Pan Laboratories (Bothell, WA). In 
5 addition, numerous means are available for random and directed synthesis 
of a wide variety of organic compounds and biomolecules, including 
expression of randomized oligonucleotides. 

Alternatively, libraries of natural compounds in the form of bacterial, 
fungal, plant and animal extracts can be readily produced. Methods for the 
10 synthesis of molecular libraries are readily available (see, e.g., DeWitt et a/., 
1993, Proc. Natl. Acad. Sci. USA 90:6909; Erb ef a/., 1994, Proc. Natl. Acad. 
Sci. USA 91:11422; Zuckermann et a/., 1994, J. Med. Chem. 37:2678; Cho 
et a/., 1993, Science 261:1303; Carell et a/., 1994, Angew. Chem. Int. Ed. 
Engl. 33:2059; Carell et al. t 1994, Angew. Chem. Int. Ed. Engl. 33:2061; and 
15 in Gallop et a/., 1994, J. Med. Chem. 37:1233). In addition, natural or 
synthetic compound libraries and compounds can be readily modified 
through conventional chemical, physical and biochemical means (see, e.g., 
Blondelle et a/., 1996, Trends in Biotech. 14:60), and may be used to 
produce combinatorial libraries. In another approach, previously identified 
20 pharmacological agents can be subjected to directed or random chemical 
modifications, such as acylation, alkylation, esterification, amidification, and 
the analogs can be screened for IR-modulating activity. 

Numerous methods for producing combinatorial libraries are known in 
the art, including those involving biological libraries; spatially addressable 
25 parallel solid phase or solution phase libraries; synthetic library methods 
requiring deconvolution; the 'one-bead one-compound' library method; and 
synthetic library methods using affinity chromatography selection. The 
biological library approach is limited to polypeptide or peptide libraries, while 
the other four approaches are applicable to polypeptide, peptide, non- 
30 peptide oligomer, or small molecule libraries of compounds (K. S. Lam, 
1997, Anticancer Drug Des. 12:145). 
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Libraries may be screened in solution by methods generally known in 
the art for determining whether ligands competitively bind at a common 
binding site. Such methods may including screening libraries in solution 
(e.g., Houghten, 1992, Biotechniques 13:412-421), or on beads (Lam, 1991, 
5 Nature 354:82-84), chips (Fodor, 1993, Nature 364:555-556), bacteria or 
spores (Ladner U.S. Pat. No. 5,223,409), plasmids (Cull et a/., 1992, Proc. 
Natl. Acad. Sci. USA 89:1865-1869), or on phage (Scott and Smith, 1990, 
Science 249:386-390; Devlin, 1990, Science 249:404-406; Cwirla et a/., 
1990, Proc. Natl. Acad. Sci. USA 97:6378-6382; Felici, 1991, J. Mol. Biol. 

10 222:301-310; Ladner, supra). 

Where the screening assay is a binding assay, IR, or one of the IR- 
binding peptides disclosed herein, may be joined to a label, where the label 
can directly or indirectly provide a detectable signal. Various labels include 
radioisotopes, fluorescent molecules, chemiluminescent molecules, 

15 enzymes, specific binding molecules, particles, e.g., magnetic particles, and 
the like. Specific binding molecules include pairs, such as biotin and 
streptavidin, digoxin and antidigoxin, etc. For the specific binding members, 
the complementary member would normally be labeled with a molecule that 
provides for detection, in accordance with known procedures. 

20 A variety of other reagents may be included in the screening assay. 

These include reagents like salts, neutral proteins, e.g., albumin, detergents, 
etc., which are used to facilitate optimal protein-protein binding and/or 
reduce non-specific or background interactions. Reagents that improve the 
efficiency of the assay, such as protease inhibitors, nuclease inhibitors, anti- 

25 microbial agents, etc., may be used. The components are added in any 
order that produces the requisite binding. Incubations are performed at any 
temperature that facilitates optimal activity, typically between 4° and 40°C. 
Incubation periods are selected for optimum activity, but may also be 
optimized to facilitate rapid high-throughput screening. Normally, between 

30 0.1 and 1 h will be sufficient. In general, a plurality of assay mixtures is run 
in parallel with different test agent concentrations to obtain a differential 
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response to these concentrations. Typically, one of these concentrations 
serves as a negative control, i.e., at zero concentration or below the level of 
detection. 

The screening assays provided in accordance with this invention are 
5 based on those disclosed in International application WO 96/04557, which is 
incorporated herein in its entirety. Briefly, WO 96/04557 discloses the use 
of reporter peptides that bind to active sites on targets and possess agonist 
or antagonist activity at the target. These reporters are identified from 
recombinant libraries and are either peptides with random amino acid 

10 sequences or variable antibody regions with at least one CDR region that 
has been randomized (rVab). The reporter peptides may be expressed in 
cell recombinant expression systems, such as for example in E. co//, or by 
phage display (see WO 96/04557 and Kay et al. 1996, Mol. Divers. 
1(2): 139-40, both of which are incorporated herein by reference). The 

1 5 reporters identified from the libraries may then be used in accordance with 
this invention either as therapeutics themselves, or in competition binding 
assays to screen for other molecules, preferably small, active molecules, 
which possess similar properties to the reporters and may be developed as 
drug candidates to provide agonist or antagonist activity. Preferably, these 

20 small organic molecules are orally active. 

The basic format of an in vitro competitive receptor binding assay as 
the basis of a heterogeneous screen for small organic molecular 
replacements for insulin may be as follows: occupation of the active site of 
IR is quantified by time-resolved fluorometric detection (TRFD) with 

25 streptavidin-labeled europium (saEu) complexed to biotinylated peptides 
(bP). In this assay, saEu forms a ternary complex with bP and IR (i.e., 
IR:bP:saEu complex). The TRFD assay format is well-established, 
sensitive, and quantitative (Tompkins et al., 1993, J. Immunol. Methods 
163:209-216). The assay can use a single-chain antibody or a biotinylated 

30 peptide. Furthermore, both assay formats faithfully report the competition of 
the biotinylated ligands binding to the active site of IR by insulin. 
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In these assays, soluble IR is coated on the surface of microtiter 
wells, blocked by a solution of 0.5% bovine serum albumin (BSA) and 2% 
non-fat milk in PBS, and then incubated with biotinylated peptide or rVab. 
Unbound bP is then washed away and saEu is added to complex with 
5 receptor-bound bP. Upon addition of the acidic enhancement solution, the 
bound europium is released as free Eu 3+ which rapidly forms a highly 
fluorescent and stable complex with components of the enhancement 
solution. The IR:bP bound saEu is then converted into its highly fluorescent 
state and detected by a detector such as Wallac Victor II (EG&G Wallac, 
10 Inc.) 

Phage display libraries can also be screened for ligands that bind to 
IR or IGF-1R, as described above. Details of the construction and analyses 
of these libraries, as well as the basic procedures for biopanning and 
selection of binders, have been published (see, e.g., WO 96/04557; 

15 Mandecki et a/., 1997, Display Technologies - Novel Targets and 
Strategies, P. Guttry (ed), International Business Communications, Inc. 
Southborogh, MA, pp. 231-254; Ravera et a/., 1998, Oncogene 16:1993- 
1999; Scott and Smith, 1990, Science 249:386-390); Grihalde et a/., 1995, 
Gene 166:187-195; Chen et a/., 1996, Proc. Natl. Acad. Sci. USA 93:1997- 

20 2001; Kay et a/., 1993, Gene 128:59-65; Carcamo et al. 9 1998, Proc. Natl. 
Acad. Sci. USA 95:11146-11151; Hoogenboom, 1997, Trends Biotechnol. 
15:62-70; Rader and Barbas, 1997, Curr. Opin. Biotechnol. 8:503-508; all of 
which are incorporated herein by reference). 

The designing of mimetics to a known pharmaceutical^ active 

25 compound is a known approach to the development of pharmaceuticals 
based on a "lead" compound. This might be desirable where the active 
compound is difficult or expensive to synthesize or where it is unsuitable for 
a particular method of administration, e.g., peptides are generally unsuitable 
active agents for oral compositions as they tend to be quickly degraded by 

30 proteases in the alimentary canal. Mimetic design, synthesis, and testing 
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are generally used to avoid large-scale screening of molecules for a target 
property. 

There are several steps commonly taken in the design of a mimetic 
from a compound having a given target property. First, the particular parts 
5 of the compound that are critical and/or important in determining the target 
property are determined. In the case of a peptide, this can be done by 
systematically varying the amino acid residues in the peptide (e.g., by 
substituting each residue in turn). These parts or residues constituting the 
active region of the compound are known as its "pharmacophore". 

10 Once the pharmacophore has been found, its structure is modeled 

according to its physical properties (e.g., stereochemistry, bonding, size, 
and/or charge), using data from a range of sources (e.g., spectroscopic 
techniques, X-ray diffraction data, and NMR). Computational analysis, 
similarity mapping (which models the charge and/or volume of a 

15 pharmacophore, rather than the bonding between atoms), and other 
techniques can be used in this modeling process. 

In a variant of this approach, the three dimensional structure of the 
ligand and its binding partner are modeled. This can be especially useful 
where the ligand and/or binding partner change conformation on binding, 

20 allowing the model to take account of this in the design of the mimetic. 

A template molecule is then selected, and chemical groups that 
mimic the pharmacophore can be grafted onto the template. The template 
molecule and the chemical groups grafted on to it can conveniently be 
selected so that the mimetic is easy to synthesize, is likely to be 

25 pharmacologically acceptable, does not degrade in vivo, and retains the 
biological activity of the lead compound. The mimetics found are then 
screened to ascertain the extent they exhibit the target property, or to what 
extent they inhibit it. Further optimization or modification can then be carried 
out to arrive at one or more final mimetics for in vivo or clinical testing. 

30 This invention provides specific IR and IGF-1 R amino acid sequences 

that function as either agonists or antagonists at IR and/or IGF-1 R. 
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Additional sequences may be obtained in accordance with the procedures 
described herein. 

H. Use of the Peptides Provided by this Invention 

The IR and IGF-1R agonist and antagonist peptides provided by this 
5 invention are useful as lead compounds for identifying other more potent or 
selective therapeutics, assay reagents for identifying other useful ligands by, 
for example, competition screening assays, as research tools for further 
analysis of IR and IGF-1R, and as potential therapeutics in pharmaceutical 
compositions. In one embodiment, one or more of the disclosed peptides 

10 can be provided as components in a kit for identifying other ligands (e.g., 
small, organic molecules) that bind to IR or IGF-1R. Such kits may also 
comprise IR or IGF-1R, or functional fragments thereof. The peptide and 
receptor components of the kit may be labeled (e.g., by radioisotopes, 
fluorescent molecules, chemiluminescent molecules, enzymes or other 

15 labels), or may be unlabeled and labeling reagents may be provided. The 
kits may also contain peripheral reagents such as buffers, stabilizers, etc. 
Instructions for use can also be provided. 

In another embodiment, the peptide sequences provided by this 
invention can be used to design secondary peptide libraries, which are 

20 derived from the peptide sequences, and include members that bind to Site 
1 and/or Site 2 of IR or IGF-1R. Such libraries can be used to identify 
sequence variants that increase or otherwise modulate the binding and/or 
activity of the original peptide at IR or IGF-1R, as described in the related 
applications of Beasley et a/. International Application PCT/US00/08528, 

25 filed March 29, 2000, and Beasley et a/., U.S. Application Serial No. 
09/538,038, filed March 29, 2000, in accordance with well-established 
techniques. 

IR agonist amino acid sequences provided by this invention are 
useful as insulin analogs and may therefore be developed as treatments for 
30 diabetes or other diseases associated with a decreased response or 
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production of insulin. For use as an insulin supplement or replacement, 
amino acid sequences include D117/H2C: FHENFYDWFVRQVSK (SEQ ID 
NO:1780); D117/H2C minus terminal lysine: FHENFYDWFVRQVS (SEQ ID 
NO:1557); D118: DYKDFYDAIQLVRSARAGGTRDKK (SEQ ID NO:1781); 
5 D118 minus FLAG® tag and terminal lysines: FYDAIQLVRSARAGGTRD 
(SEQ ID NO:1782); D119: KDRAFYNGLRDLVGAVYGAWDKK (SEQ ID 
NO: 1733); D119 minus terminal lysines: KDRAFYNGLRDLVGAVYGAWD 
(residues 1-21 of SEQ ID NO:1733); D116/JBA5: 
DYKDLCQSWGVRIGWLAGLCPKK (SEQ ID NO:1541); D116/JBA5 minus 

10 FLAG® tag and terminal lysines: LCQSWGVRIGWLAGLCP (SEQ ID 
NO:1542); D113/H2: DYKDVTFTSAVFH E N FYDW FVRQVSKK (SEQ ID 
NO: 1783); D113/H2 minus FLAG® tag and terminal lysines: 
VTFTS A V F H E N FYDW F VRQ VS (SEQ ID NO:1784); and S175: 
GRVDWLQRNANFYDWFVAELG (SEQ ID NO:1560). Preferred peptide 

15 dimer sequences are represented by S325, S332, S333, S335, S337, S353, 
S374-S376, S378, S379, S381, S414, S415, and S418 (see Table 7). Other 
preferred dimers sequences are represented by S455, S457, S458, S467, 
S468, S471, S499, S510, S518, S519, and S520 sequences (see Table 7). 
Especially preferred is the S519 dimer sequence, which shows in vitro and 

20 in vivo activity comparable to insulin (see Figures 31 A-C, 32A-B, and 33). 

IGF-1 R antagonist amino acid sequences provided by this invention 
are useful as treatments for cancers, including, but not limited to, breast, 
prostate, colorectal, and ovarian cancers. Human and breast cancers are 
responsible for over 40,000 deaths per year, as present treatments such as 

25 surgery, chemotherapy, radiation therapy, and immunotherapy show limited 
success. The IGF-1 R antagonist amino acid sequences disclosed herein 
are also useful for the treatment or prevention of diabetic retinopathy. 
Recent reports have shown that a previously identified IGF-1 R antagonist 
can suppress retinal neovascularization, which causes diabetic retinopathy 

30 (Smith ef a/., 1999, Nat. Med. 5:1390-1395). Preferred IGF-1R antagonist 
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amino acid sequences include those comprising the sequences of RP33- 
IGF and RP33K-IGF (Tables 24-26). 

IGF-1R agonist amino acid sequences provided by this invention are 
useful for development as treatments for neurological disorders, including 
5 stroke and diabetic neuropathy. Reports of several different groups 
implicate IGF-1R in the reduction of global brain ischemia, and support the 
use of IGF-1 for the treatment of diabetic neuropathy (reviewed in Auer et 
a/., 1998, Neurology 51:S39-S43; Apfel, 1999, Am. J. Med. 107:34S-42S). 
The IGF-1 R agonist peptides of the invention may be useful for enhancing 
10 the survival of cells and/or blocking apoptosis in cells. Preferred IGF-1 R 
agonist amino acid sequences include those comprising the sequences of 
G33, RP48, RP60, and RP30-IGF-12-RP31-IGF (Tables 27-29). 

I. Methods of Administration 

The amino acid sequences of this invention may be administered as 
15 pharmaceutical compositions comprising standard carriers known in the art 
for delivering proteins and peptides and by gene therapy. Preferably, a 
pharmaceutical composition includes, in admixture, a pharmaceutical^ (i.e., 
physiologically) acceptable carrier, excipient, or diluent, and one or more of 
an IR or IGF-1 R agonist or antagonist peptide, as an active ingredient. The 
20 preparation of pharmaceutical compositions that contain peptides as active 
ingredients is well understood in the art. Typically, such compositions are 
prepared as injectables, either as liquid solutions or suspensions, however, 
solid forms suitable for solution in, or suspension in, liquid prior to injection 
can also be prepared. The preparation can also be emulsified. The active 
25 therapeutic ingredient is often mixed with excipients that are 
pharmaceutical^ (i.e., physiologically) acceptable and compatible with the 
active ingredient. Suitable excipients are, for example, water, saline, 
dextrose, glycerol, ethanol, or the like and combinations thereof. In addition, 
if desired, the composition can contain minor amounts of auxiliary 
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substances such as wetting or emulsifying agents, pH-buffering agents, 
which enhance the effectiveness of the active ingredient. 

An IR or IGF-1R agonist or antagonist peptide can be formulated into 
a pharmaceutical composition as neutralized physiologically acceptable salt 
5 forms. Suitable salts include the acid addition salts (i.e., formed with the 
free amino groups of the peptide molecule) and which are formed with 
inorganic acids such as, for example, hydrochloric or phosphoric acids, or 
such organic acids as acetic, oxalic, tartaric, mandelic, and the like. Salts 
formed from the free carboxyl groups can also be derived from inorganic 

10 bases such as, for example, sodium, potassium, ammonium, calcium, or 
ferric hydroxides, and such organic bases as isopropylamine, 
trimethylamine, 2-ethylamino ethanol, histidine, procaine, and the like. 

The pharmaceutical compositions can be administered systemically 
by oral or parenteral routes. Non-limiting parenteral routes of administration 

15 include subcutaneous, intramuscular, intraperitoneal, intravenous, 
transdermal, inhalation, intranasal, intra-arterial, intrathecal, enteral, 
sublingual, or rectal. Due to the labile nature of the amino acid sequences 
parenteral administration is preferred. Preferred modes of administration 
include aerosols for nasal or bronchial absorption; suspensions for 

20 intravenous, intramuscular, intrasternal or subcutaneous, injection; and 
compounds for oral administration. 

Intravenous administration, for example, can be performed by 
injection of a unit dose. The term "unit dose" when used in reference to a 
pharmaceutical composition of the present invention refers to physically 

25 discrete units suitable as unitary dosage for humans, each unit containing a 
predetermined quantity of active material calculated to produce the desired 
therapeutic effect in association with the required diluent; i.e., liquid used to 
dilute a concentrated or pure substance (either liquid or solid), making that 
substance the correct (diluted) concentration for use. For injectable 

30 administration, the composition is in sterile solution or suspension or may be 
emulsified in pharmaceutically- and physiologically-acceptable aqueous or 
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oleaginous vehicles, which may contain preservatives, stabilizers, and 
material for rendering the solution or suspension isotonic with body fluids 
(i.e., blood) of the recipient. 

Excipients suitable for use are water, phosphate buffered saline, pH 
5 7.4, 0.15 M aqueous sodium chloride solution, dextrose, glycerol, dilute 
ethanol, and the like, and mixtures thereof. Illustrative stabilizers are 
polyethylene glycol, proteins, saccharides, amino acids, inorganic acids, and 
organic acids, which may be used either on their own or as admixtures. The 
amounts or quantities, as well as routes of administration, used are 

10 determined on an individual basis, and correspond to the amounts used in 
similar types of applications or indications known to those of skill in the art. 

Pharmaceutical compositions are administered in a manner 
compatible with the dosage formulation, and in a therapeutically effective 
amount. The quantity to be administered depends on the subject to be 

15 treated, capacity of the subject's immune system to utilize the active 
ingredient, and degree of modulation of IR or IGF-1R activity desired. 
Precise amounts of active ingredient required to be administered depend on 
the judgment of the practitioner and are specific for each individual. 
However, suitable dosages may range from about 10 to 200 nmol active 

20 peptide per kilogram body weight of individual per day and depend on the 
route of administration. Suitable regimes for initial administration and 
booster shots are also variable, but are typified by an initial administration 
followed by repeated doses at one or more hour intervals by a subsequent 
injection or other administration. Alternatively, continuous intravenous 

25 infusions sufficient to maintain picomolar concentrations (e.g., approximately 
1 pM to approximately 10 nM) in the blood are contemplated. An exemplary 
formulation comprises the IR or IGF-1R agonist or antagonist peptide in a 
mixture with sodium busulfite USP (3.2 mg/ml); disodium edetate USP (0.1 
mg/ml); and water for injection q.s.a.d. (1 ml). 

30 Further guidance in preparing pharmaceutical formulations can be 

found in, e.g., Gilman et al. (eds), 1990, Goodman and Gilman's: The 
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Pharmacological Basis of Therapeutics, 8th ed., Pergamon Press; and 
Remington's Pharmaceutical Sciences, 17th ed., 1990, Mack Publishing 
Co., Easton, PA; Avis et ai (eds), 1993, Pharmaceutical Dosage Forms: 
Parenteral Medications, Dekker, New York; Lieberman et al. (eds), 1990, 
5 Pharmaceutical Dosage Forms: Disperse Systems, Dekker, New York. 

The present invention further contemplates compositions comprising 
an IR or IGF-1R agonist or antagonist peptide, and a physiologically 
acceptable carrier, excipient, or diluent as described in detail herein. 

The constructs as described herein may also be used in gene 

10 transfer and gene therapy methods to allow the expression of one or more 
amino acid sequences of the present invention. The amino acid sequences 
of the present invention can be used for gene therapy and thereby provide 
an alternative method of treating diabetes which does not rely on the 
administration or expression of insulin. Expressing insulin for use in gene 

15 therapy requires the expression of a precursor product, which must then 
undergo processing including cleavage and disulfide bond formation to form 
the active product. The amino acid sequences of this invention, which 
possess activity, are relatively small, and thus do not require the complex 
processing steps to become active. Accordingly, these sequences provide a 

20 more suitable product for gene therapy. 

Gene transfer systems known in the art may be useful in the practice 
of the gene therapy methods of the present invention. These include viral 
and non-viral transfer methods. A number of viruses have been used as 
gene transfer vectors, including polyoma, i.e., SV40 (Madzak et ai, 1992, J. 

25 Gen. Virol. , 73:1533-1536), adenovirus (Berkner, 1992, Curr. Top. Microbiol. 
Immunol., 158:39-6; Berkner et ai, 1988, Bio Techniques, 6:616-629; 
Gorziglia et al., 1992, J. Virol., 66:4407-4412; Quantin et ai, 1992, Proc. 
Natl. Acad. Sci. USA, 89:2581-2584; Rosenfeld et al., 1992, Cell, 68:143- 
155; Wilkinson et al., 1992, Nucl. Acids Res., 20:2233-2239; Stratford- 

30 Perricaudet et ai, 1990, Hum. Gene Then, 1:241-256), vaccinia virus 
(Mackett et al., 1992, Biotechnology, 24:495- 499), adeno-associated virus 
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(Muzyczka, 1992, Curr. Top. Microbiol. Immunol. 158:91- 123; Ohi et al., 
1990, Gene, 89:279-282), herpes viruses including HSV and EBV 
(Margolskee, 1992, Curr. Top. Microbiol. Immunol. 158:67-90; Johnson et 
a/., 1992, J. Virol., 66:2952-2965; Fink et a/., 1992, Hum. Gene Ther. 3:11- 
5 19; Breakfield et al., 1987, Mol. Neurobiol., 1:337-371; Fresse et at., 1990, 
Biochem. Pharmacol. 40:2189-2199), and retroviruses of avian 
(Brandyopadhyay et al., 1984, Mol. Cell Biol., 4:749-754; Petropouplos et 
al., 1992, J. Virol., 66:3391-3397), murine (Miller, 1992, Curr. Top. Microbiol. 
Immunol. 158:1-24; Miller et al., 1985, Mol. Cell Biol., 5:431-437; Sorge et 

10 al., 1984, Mol. Cell Biol., 4:1730-1737; Mann et al., 1985, J. Virol., 54:401- 
407), and human origin (Page et al., 1990, J. Virol., 64:5370-5276; 
Buchschalcher et al., 1992, J. Virol., 66:2731-2739). Most human gene 
therapy protocols have been based on disabled murine retroviruses. 

Non-viral gene transfer methods known in the art include chemical 

15 techniques such as calcium phosphate coprecipitation (Graham etal., 1973, 
Virology, 52:456-467; Pellicer et al., 1980, Science, 209:1414-1422), 
mechanical techniques, for example microinjection (Anderson et al., 1980, 
Proc. Natl. Acad. Sci. USA, 77:5399-5403; Gordon era/., 1980, Proc. Natl. 
Acad. Sci. USA, 77:7380-7384; Brinster et al., 1981, Cell, 27:223-231; 

20 Constantini et al., 1981, Nature, 294:92-94), membrane fusion-mediated 
transfer via liposomes (Feigner et al., 1987, Proc. Natl. Acad. Sci. USA, 
84:7413-7417; Wang et al., 1989, Biochemistry, 28:9508-9514; Kaneda et 
al., 1989, J. Biol. Chem., 264:12126-12129; Stewart etal., 1992, Hum. Gene 
Ther. 3:267-275; Nabel et al., 1990, Science, 249:1285-1288; Lim et al., 

25 1992, Circulation, 83:2007-2011; U.S. Patent Nos. 5,283,185 and 
5,795,587), and direct DNA uptake and receptor-mediated DNA transfer 
(Wolff et al., 1990, Science, 247:1465-1468; Wu et al., 1991, 
BioTechniques, 11:474-485; Zenke et al., 1990, Proc. Natl. Acad. Sci. USA, 
87:3655-3659; Wu et al., 1989, J. Biol. Chem., 264:16985-16987; Wolff et 

30 al., 1991, BioTechniques, 11:474-485; Wagner et al., 1991, Proc. Natl. 
Acad. Sci. USA, 88:4255-4259; Cotten et al., 1990, Proc. Natl. Acad. Sci. 



WO 03/027246 PCT/US02/30412 



-89- 

USA f 87:4033-4037; Curiel et a/., 1991, Proc. Natl. Acad. Sci. USA, 
88:8850-8854; Curiel et a/., 1991, Hum. Gene Ther. 3:147-154). 

Many types of cells and cell lines (e.g., primary cell lines or 
established cell lines) and tissues are capable of being stably transfected by 
5 or receiving the constructs of the invention. Examples of cells that may be 
used include, but are not limited to, stem cells, B lymphocytes, T 
lymphocytes, macrophages, other white blood lymphocytes (e.g., 
myelocytes, macrophages, or monocytes), immune system cells of different 
developmental stages, erythroid lineage cells, pancreatic cells, lung cells, 

10 muscle cells, liver cells, fat cells, neuronal cells, glial cells, other brain cells, 
transformed cells of various cell lineages corresponding to normal cell 
counterparts (e.g., K562, HEL, HL60, and MEL cells), and established or 
otherwise transformed cells lines derived from all of the foregoing. In 
addition, the constructs of the present invention may be transferred by 

15 various means directly into tissues, where they would stably integrate into 
the cells comprising the tissues. Further, the constructs containing the DNA 
sequences of the peptides of the invention can be introduced into primary 
cells at various stages of development, including the embryonic and fetal 
stages, so as to effect gene therapy at early stages of development. 

20 In one approach, plasmid DNA is complexed with a polylysine- 

conjugated antibody specific to the adenovirus hexon protein, and the 
resulting complex is bound to an adenovirus vector. The trimolecular 
complex is then used to infect cells. The adenovirus vector permits efficient 
binding, internalization, and degradation of the endosome before the 

25 coupled DNA is damaged. 

In another approach, liposome/DNA is used to mediate direct in vivo 
gene transfer. While in standard liposome preparations the gene transfer 
process is non-specific, localized in vivo uptake and expression have been 
reported in tumor deposits, for example, following direct in situ 

30 administration (Nabel, 1992, Hum. Gene Ther. 3:399-410). 
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Suitable gene transfer vectors possess a promoter sequence, 
preferably a promoter that is cell-specific and placed upstream of the 
sequence to be expressed. The vectors may also contain, optionally, one or 
more expressible marker genes for expression as an indication of successful 
5 transfection and expression of the nucleic acid sequences contained in the 
vector. In addition, vectors can be optimized to minimize undesired 
immunogenicity and maximize long-term expression of the desired gene 
product(s) (see Nabe, 1999, Proc. Natl. Acad. Sci. USA 96:324-326). 
Moreover, vectors can be chosen based on cell-type that is targeted for 
1 0 treatment. 

Illustrative examples of vehicles or vector constructs for transfection 
or infection of the host cells include replication-defective viral vectors, DNA 
virus or RNA virus (retrovirus) vectors, such as adenovirus, herpes simplex 
virus and adeno-associated viral vectors. Adeno-associated virus vectors 

15 are single stranded and allow the efficient delivery of multiple copies of 
nucleic acid to the cell's nucleus. Preferred are adenovirus vectors. The 
vectors will normally be substantially free of any prokaryotic DNA and may 
comprise a number of different functional nucleic acid sequences. An 
example of such functional sequences may be a DNA region comprising 

20 transcriptional and translational initiation and termination regulatory 
sequences, including promoters (e.g., strong promoters, inducible 
promoters, and the like) and enhancers which are active in the host cells. 
Also included as part of the functional sequences is an open reading frame 
(polynucleotide sequence) encoding a protein of interest. Flanking 

25 sequences may also be included for site-directed integration. In some 
situations, the 5'-flanking sequence will allow homologous recombination, 
thus changing the nature of the transcriptional initiation region, so as to 
provide for inducible or non-inducible transcription to increase or decrease 
the level of transcription, as an example. 

30 In general, the encoded and expressed peptide may be intracellular, 

i.e., retained in the cytoplasm, nucleus, or in an organelle, or may be 
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secreted by the cell. For secretion, a signal sequence may be fused to the 
peptide sequence. As previously mentioned, a marker may be present for 
selection of cells containing the vector construct. The marker may be an 
inducible or non-inducible gene and will generally allow for positive selection 
5 under induction, or without induction, respectively. Examples of marker 
genes include neomycin, dihydrofolate reductase, glutamine synthetase, 
and the like. The vector employed will generally also include an origin of 
replication and other genes that are necessary for replication in the host 
cells, as routinely employed by those having skill in the art. As an example, 

10 the replication system comprising the origin of replication and any proteins 
associated with replication encoded by a particular virus may be included as 
part of the construct. The replication system must be selected so that the 
genes encoding products necessary for replication do not ultimately 
transform the cells. Such replication systems are represented by 

15 replication-defective adenovirus (see G. Acsadi et a/., 1994, Hum. MoL 
Genet. 3:579-584) and by Epstein-Barr virus. Examples of replication 
defective vectors, particularly, retroviral vectors that are replication 
defective, are BAG, (see Price et a/., 1987, Proc. Natl. Acad. Sci. USA, 
84:156; Sanes et a/., 1986, EMBO J., 5:3133). It will be understood that the 

20 final gene construct may contain one or more genes of interest, for example, 
a gene encoding a bioactive metabolic molecule. In addition, cDNA, 
synthetically produced DNA or chromosomal DNA may be employed 
utilizing methods and protocols known and practiced by those having skill in 
the art. 

25 According to one approach for gene therapy, a vector encoding an IR 

or IGF-1R agonist or antagonist peptide is directly injected into the recipient 
cells (in vivo gene therapy). Alternatively, cells from the intended recipients 
are explanted, genetically modified to encode an IR or IGF-1R agonist or 
antagonist peptide, and reimplanted into the donor (ex vivo gene therapy). 

30 An ex vivo approach provides the advantage of efficient viral gene transfer, 
which is superior to in vivo gene transfer approaches. In accordance with ex 
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vivo gene therapy, the host cells are first transfected with engineered 
vectors containing at least one gene encoding an IR or IGF-1R agonist or 
antagonist peptide, suspended in a physiologically acceptable carrier or 
excipient such as saline or phosphate buffered saline, and the like, and then 
5 administered to the host or host cells. The desired gene product is 
expressed by the injected cells, which thus introduce the gene product into 
the host. The introduced gene products can thereby be utilized to treat or 
ameliorate a disorder that is related to altered insulin or IGF-1 levels (e.g., 
diabetes). 

10 The described constructs may be administered in the form of a 

pharmaceutical preparation or composition containing a pharmaceutically 
acceptable carrier and a physiological excipient, in which preparation the 
vector may be a viral vector construct, or the like, to target the cells, tissues, 
or organs of the recipient organism of interest, including human and non- 

15 human mammals. The composition may be formed by dispersing the 
components in a suitable pharmaceutically acceptable liquid or solution such 
as sterile physiological saline or other injectable aqueous liquids. The 
amounts of the components to be used in such compositions may be 
routinely determined by those having skill in the art. The compositions may 

20 be administered by parenteral routes of injection, including subcutaneous, 
intravenous, intramuscular, and intrasternal. 

J. Cancer Therapeutics 

In recent experiments, embryo fibroblasts from IGF-1 R knock-out 
mice have been shown to be highly resistant to transformation by 

25 oncogenes such as SV40 T antigen, activated Ha-ras, activated Src, and 
others (B. Valentinis and R. Baserga, 2001, Mol. Pathol., 54:133-137). This 
suggested that IGF-1 R was required to mediate malignant transformation by 
these oncogenes. In addition, IGF-1 and IGF-1 R have been shown to act as 
transforming factors in various forms of human cancer (see above). IGF-1 

30 and IGF-2 have also been implicated as factors in the malignant 
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transformation of several tissues. Transgenic mice that express a truncated 
form of IGF-1 that has a decreased affinity for IGFBPs (des(1-3) IGF-11), 
show increased incidence of mammary tumors (Hadsell et a/., 2000, 
Oncogene 19:889-898). In addition, mice over-expressing IGF-11 in 
5 mammary glands showed increased mammary tumor formation (Bates et 
aL, 1995, Br. J. Cancer 72:1 189-1 193). Transgenic mice that overexpress 
IGF-1 in the basal layer of the skin show hyperplasia of the epidermis and 
increased promotion of spontaneous tumors (DiGiovanni et aL, 2000, 
Cancer Res. 60:1 561 -1 570). 

10 IGF-1 R also appears to cross-talk with other hormone receptors. 

Considerable evidence suggests that estrogen can act to increase 
expression of IGF-1 R. This is of particular importance in breast cancer, 
where the expression of IGF-1 R correlates with expression of the estrogen 
receptor (ER). IGF-1 R expression is higher in tumors from ER positive 

15 patients. Accordingly, IGF-1 R expression could be used as a prognostic 
marker for breast cancer patients. In addition, high levels of IRS-1, a key 
intermediate in the IGF-1 R signal transduction cascade, correlates with 
tumor size and shorter disease-free survival in patients with ER positive 
tumors (D. Sachdev and D. Yee, 2001, Endocr. Relat. Cancer 8:197-209). 

20 In addition, treatment with anti-estrogens has been shown to decrease the 
expression of IGF-1 R and IRS-1 (Chan et aL, 2001, Clin. Cancer Res. 
7:2545-2554). Thus, the cross-talk between IGF-1 R and ER may be 
complex. Yet, it is clear that IGF-signaling promotes malignant 
transformation in mammary glands. Interestingly, ER positive MCF-7 cells 

25 treated with IGF-1 show a sustained activation of the PI3K-Akt pathway and 
protection against apoptosis induced by serum deprivation. In contrast, ER 
negative MDA-MB 231 cells show only a transient activation of PI3K-AW 
pathway (Bartucci et aL, 2001, Cancer Res. 61:6747-6754). 

Studies have also revealed a connection between IGF-1 R-mediated 

30 signaling and epidermal growth factor (EGF)-induced signaling through 
ErbB-receptors. IGF-1 R and ErbB-2 (Neu/Her2) have been observed to 
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form hetero-oligomers induced by stimulation with heregulin or IGF-1 
(Balana et aL, 2001, Oncogene, 19:34-47, 2001). In glioblastomas, 
resistance to a chemical inhibitor of the ErbB receptor tyrosine kinase has 
been correlated with increased IGF-1 R expression and constitutive PI3K 
5 signaling (Chakravarti et aL, 2002, Cancer Res. 62:200-207). In breast 
cancer cell lines over expressing ErbB-2, increased IGF-1 R signaling was 
observed in the presence of the anti-ErbB-2 receptor monoclonal antibody 
Herceptin®/trastuzumab (Lu et aL, 2001, J. Natl. Cancer Inst 93:1852- 
1857). 

10 Modulation of IGF-signaling in various malignant cells has provided 

further evidence for the involvement of the IGF-1 R in cancer. Abrogation of 
IGF-1 R expression by antisense RNA reversed the transformed phenotype 
in cervical cancer cells. Antisense to IGF-1 R also inhibited glioblastoma and 
melanoma xenografts in nude mice (Resnicoff et aL, 1994, Cancer Res. 

15 54:4848-4850; Resnicoff et aL, 1994, Cancer Res. 54:2218-2222; Nakamura 
et aL, 2000, Cancer Res. 60:760-765, 2000). Experiments have also 
indicated that IGF-1 R is involved in the development and maintenance of 
metastatic phenotypes. In particular, high expression of a dominant 
negative mutant of IGF-1 R (486stop) in ER positive breast cancer cells has 

20 been shown to inhibit adhesion, invasion, and metastasis of the cells (Dunn 
et aL, 1998, Cancer Res. 58:3353-3361). Moreover, lung carcinoma cells 
exhibited an enhanced metastatic phenotype following overexpression of 
IGF-1R (Long et aL, 1998, Exp. Cell Res. 238:116-121). In addition, 
activation of IGF-1 R has been shown to block apoptotic pathways. 

25 Apoptosis in mammary glands was inhibited in IGF-1 transgenic mice 
(Hadsell et aL, 2000, Oncogene 19:889-898). Moreover, down-regulation of 
IGF-1 R function, either by antisense strategies or dominant negative 
mutants, caused massive apoptosis of tumor cells in vitro and in vivo. IGF-1 
has also been shown to inhibit apoptosis associated with transformation by 

30 the c-myc oncogene and apoptosis induced by chemotherapeutic agents. 
The anti-apoptotic signaling of IGF-1 has been attributed to the PI3K-Akt 
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pathway, although other pathways may mediate similar effects (Butt et a/. f 
1999, Immunol. Cell Biol. 77:256-262; B. Valentinis and R. Baserga, 2001, 
Mol. Pathol. 54:133-137). 

The sum of these observations indicate the importance of identifying 
5 antagonists or inhibitors of IGF-1 and/or IGF-1 R. Attempts have been made 
to develop clinically relevant inhibitors of IGF-1 R using monoclonal 
antibodies, antisense strategies, and peptide fragments derived from the 
natural ligand (Dunn et a/., 1998, Cancer Res. 58:3353-3361; Z. 
Pietrzkowski et a/., 1992, Cancer Res. 52:6447-6451; Z. Pietrzkowski et a/., 

10 1993, Cancer Res. 53:1102-1106; Rubini et a/., 1999, Exp. Cell Res. 
251 :22-32). Using an alternate approach, this invention provides methods, 
kits, and compositions (e.g., pharmaceutical compositions) comprising IGF- 
1R antagonist peptides, or small molecule mimetics thereof, that can be 
useful in the diagnosis, treatment, and monitoring of one or more cancers. 

15 In some cases, the compositions, methods, and kits of the invention can 
also be used to determine the prognosis of a IGF-related medical condition 
(e.g., cancer). Advantageously, certain IGF-1 R antagonist peptides 
disclosed herein are specific for Site 1 or Site 2 of the IGF-1 receptor. 

In accordance with the invention, non-limiting cancer types include 

20 carcinoma, sarcoma, myeloma, leukemia, and lymphoma, and mixed types 
of cancers, such as adenosquamous carcinoma, mixed mesodermal tumor, 
carcinosarcoma, and teratocarcinoma. Representative cancers include, but 
are not limited to, bladder cancer, lung cancer, breast cancer, colon cancer, 
rectal cancer, endometrial cancer, ovarian cancer, head and neck cancer, 

25 prostate cancer, and melanoma. Specifically included are AIDS-related 
cancers (e.g., Kaposi's Sarcoma, AIDS-related lymphoma), bone cancers 
(e.g., osteosarcoma, malignant fibrous histiocytoma of bone, Ewing's 
Sarcoma, and related cancers), and hematologic/blood cancers (e.g., adult 
acute lymphoblastic leukemia, childhood acute lymphoblastic leukemia, 

30 adult acute myeloid leukemia, childhood acute myeloid leukemia, chronic 
lymphocytic leukemia, chronic myelogenous leukemia, hairy cell leukemia, 
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cutaneous T-cell lymphoma, adult HodgkiiYs disease, childhood Hodgkin's 
disease, Hodgkin's disease during pregnancy, mycosis fungoides, adult non- 
Hodgkin's lymphoma, childhood non-Hodgkin's lymphoma, non-Hodgkin's 
lymphoma during pregnancy, primary central nervous system lymphoma, 
5 Sezary syndrome, cutaneous T-cell lymphoma, Waldenstrom's 
macroglobulinemia, multiple myeloma/plasma cell neoplasm, 
myelodysplastic syndrome, and myeloproliferative disorders). 

Also included are brain cancers (e.g., adult brain tumor, childhood 
brain stem glioma, childhood cerebellar astrocytoma, childhood cerebral 

10 astrocytoma, childhood ependymoma, childhood medulloblastoma, 
supratentorial primitive neuroectodermal and pineal, and childhood visual 
pathway and hypothalamic glioma), digestive/gastrointestinal cancers (e.g., 
anal cancer, extrahepatic bile duct cancer, gastrointestinal carcinoid tumor, 
colon cancer, esophageal cancer, gallbladder cancer, adult primary liver 

15 cancer, childhood liver cancer, pancreatic cancer, rectal cancer, small 
intestine cancer, and gastric cancer), musculoskeletal cancers (e.g., 
childhood rhabdomyosarcoma, adult soft tissue sarcoma, childhood soft 
tissue sarcoma, and uterine sarcoma), and endocrine cancers (e.g., 
adrenocortical carcinoma, gastrointestinal carcinoid tumor, islet cell 

20 carcinoma (endocrine pancreas), parathyroid cancer, pheochromocytoma, 
pituitary tumor, and thyroid cancer). 

Further included are neurologic cancers (e.g., neuroblastoma, 
pituitary tumor, and primary central nervous system lymphoma), eye 
cancers (e.g., intraocular melanoma and retinoblastoma), genitourinary 

25 cancers (e.g., bladder cancer, kidney (renal cell) cancer, penile cancer, 
transitional cell renal pelvis and ureter cancer, testicular cancer, urethral 
cancer, Wilms* tumor and other childhood kidney tumors), 
respiratory/thoracic cancers (e.g., non-small cell lung cancer, small cell lung 
cancer, malignant mesothelioma, and malignant thymoma), germ cell 

30 cancers (e.g., childhood extracranial germ cell tumor and extragonadal germ 
cell tumor), skin cancers (e.g., melanoma, and merkel cell carcinoma), 
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gynecologic cancers (e.g., cervical cancer, endometrial cancer, gestational 
trophoblastic tumor, ovarian epithelial cancer, ovarian germ cell tumor, 
ovarian low malignant potential tumor, uterine sarcoma, vaginal cancer, and 
vulvar cancer), and unknown primary cancers. 
5 Specific breast cancers include, but are not limited to, non-invasive 

cancers, such as ductal carcinoma in situ (DCIS), intraductal carcinoma 
lobular carcinoma in situ (LCIS), papillary carcinoma, and 
comedocarcinoma, or invasive cancers, such as adenocarcinomas, or 
carcinomas, e.g., infiltrating ductal carcinoma, infiltrating lobular carcinoma, 

10 infiltrating ductal and lobular carcinoma, medullary carcinoma, mucinous 
(colloid) carcinoma, comedocarcinoma, Paget's Disease, papillary 
carcinoma, tubular carcinoma, and inflammatory carcinoma. Specific 
prostate cancers may include adenocarcinomas and sarcomas, or pre- 
cancerous conditions, such as prostate intraepithelial neoplasia (PIN). 

15 Specific lung cancers include those relating to tumors such as bronchial 
carcinoid (bronchial adenoma), chondromatous hamartoma (benign), 
solitary lymphoma, and sarcoma (malignant) tumors, as well as lung 
cancers relating to multifocal lymphomas. Bronchogenic carcinomas may 
present as squamous cell carcinomas, small cell carcinomas, non-small cell 

20 carcinomas, or adenocarcinomas. 

The IGF-1R antagonist peptides of the invention may be administered 
individually, or in combination with other IGF-1 or IGF-1R antagonists or 
inhibitors. Alternatively, the disclosed IGF-1 R antagonist peptides can be 
used in combination with other cancer therapies, e.g., surgery, radiation, 

25 biological response modification, immunotherapy, hormone therapy, and/or 
chemotherapy. For prostate cancers, non-limiting examples of 
chemotherapeutic agents include docetaxel, paclitaxel, estramustine, 
etoposide, vinblastine, mitoxantrone, and paclitaxel. For breast cancers, 
non-limiting examples of chemotherapeutic and biological agents include 

30 cyclophosphamide, methotrexate, 5-fluorouracil, doxorubicin, tamoxifen, 
paclitaxel, docetaxel, navelbine, capecitabine, mitomycin C, Interferons, 
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interleukin-2, lymphocyte-activated killer cells, tumor necrosis factors, and 
monoclonal antibodies (e.g., mAb to HER-2/neu receptor (trastuzumab) 
Herceptin®). For lung cancers, non-limiting examples of chemotherapeutic 
and biological agents include, but are not limited to, platinum compounds 
5 (e.g., cisplatin or carboplatin), vinca alkaloids (e.g., vinorelbine, vincristine, 
or vinblastine), taxines (e.g., docetaxel or paclitaxel), and various 
topoisomerase inhibitors. 

EXAMPLES 

The examples as set forth herein are meant to exemplify the various 
10 aspects of the present invention and are not intended to limit the invention in 
any way. 

The following materials were used in the examples described below. 
Soluble IGF-1R was obtained from R&D Systems (Minneapolis, MN; Cat. # 
391-GR/CF). Insulin receptor was prepared according to Bass et a/., 1996. 

15 The insulin was either from Sigma (St. Louis, MO; Cat. # I-0259) or 
Boehringer. The IGF-1 was from PeproTech (Cat. # 100-11). All synthetic 
peptides were synthesized by Novo Nordisk, AnaSpec, Inc. (San Jose, CA), 
PeptioGenics (Livermore, CA), or Research Genetics (Huntsville, AL) at 
>80% purity. The Maxisorb Plates were from NUNC via Fisher (Cat. # 

20 12565347). The HRP/Anti-M13 conjugate was from Pharmacia (Cat. # 27- 
9421-01 ). The ABTS solution was from BioF/X (Cat. # ABTS-0 100-04). 

Example 1 : Monomer and Dimer Peptides 
A. Cloning 

Monomer and dimer peptides were constructed and expressed as 
25 protein fusions to a chitin binding domain (CBD) using the pTYB2 vector 
from the IMPACT™-CN system (New England Biolabs (NEB), Beverly, MA). 
The pTYB2 vector encodes a protein-splicing element (termed intein), which 
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initiates self-cleavage upon the addition of DTT. The intein self-cleavage 
separates the dimer from the affinity tag, to allow purification. 

In the pTYB2 construct, the C-terminus of the peptide sequence was 
fused to the N-terminus of the intein/CBD sequence. Two peptide-flanking 
5 epitope tags were included: a shortened-FLAG® at the N-terminus and E- 
Tag at the C-terminus. This fusion was generated by ligating a vector 
fragment encoding the intein/CBD with a PCR product encoding the peptide 
of interest. 

The vector fragment was obtained by digesting at appropriate 

10 restriction sites the pTBY2 vector. The digested DNA fragment was 
resolved on a 1% agarose gel, excised, and purified by QIAEXII (QIAGEN, 
Valencia, CA). To obtain the PCR product of the target proteins, primers 
were synthesized which anneal to appropriate sequences. The vector and 
insert were ligated overnight at 15°C. The ligation product was purified 

15 using QIAquick spin columns (QIAGEN) and electroporations were 
performed at 1500 V in an electroporation cuvette (0.1 mm gap; 0.5 ml 
volume) containing 10 ng of DNA and 40 pi of E. coli strain BL21 . 

Immediately following electroporation, 1 ml of pre-warmed (40°C) 
2xYT medium containing 2% glucose (2xYT-G) was added to the 

20 transformants. The transformants were grown at 37°C for 1 h, and then 
plated onto 2xYT-AG plates and incubated overnight at 37°C. Individual 
colonies were isolated and used to innoculate 2xYT-G. The cultures were 
grown overnight at 37°C. Plasmid DNA was isolated from the cultures and 
sequencing was performed to confirm that the correct construct was 

25 obtained. 

B. Small-scale expression of peptide-CBD fusion proteins 

E .coli ER2566 (New England Biolabs) containing plasmids encoding 
peptide-CBD fusion proteins were grown in 2xYT-AG at 37°C overnight, with 
agitation (250 rpm). The following day, the cultures were used to inoculate 
30 media (2x YT-G) to obtain an OD 600 of 0.1 . Upon reaching an OD 600 of 0.6, 
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expression of the fusion protein was induced by the addition of IPTG 
(isopropyl-p-D-thiogalactopyranoside) to a final concentration of 0.3 mM. 
Cells were grown for 3 h. Following this, cells were pelleted by 
centrifugation and the cell pellets were analyzed by SDS-PAGE 
5 electrophoresis. Production of the correct molecular weight fusion proteins 
was confirmed by Western blot analysis using the monoclonal antibody anti- 
E-Tag-HRP conjugate (Amersham Pharmacia). 

C. Large-scale expression and purification of soluble 
peptide-CBD fusion proteins 

10 E. coli ER2566 carrying plasmids encoding the fusion proteins were 

grown in 2xYT-AG media at 37°C for 8 h, with agitation (250 rpm). The 
cultures were back-diluted into to 2 L volumes of 2xYT-A to achieve an 
OD 6 oo of 0.1. Upon reaching an OD 600 of 0.5, IPTG was added to a final 
concentration of 0.3 mM. Cells were grown at 30°C overnight. The next day 

15 cells were isolated by centrifugation. Samples of the cell pellet were 
analyzed by SDS-PAGE followed by the Western blot analysis using the 
mouse monoclonal antibody anti-E-Tag-HRP conjugate (Pharmacia) to 
visualize the expressed product. 

D. Purification 

20 The cell pellets were disrupted mechanically by sonication or 

chemically by treatment with the mild detergent. After removal of cell debris 
by centrifugation, the soluble proteins in the clarified lysate were prepared 
for chromatographic purification by dilution or dialysis into the appropriate 
starting buffer. The CBD fusions were purified by chitin affinity 

25 chromatography according to the manufacturer's instructions (New England 
Biolabs). The lysate was loaded onto a chitin affinity column and the column 
was washed with 10 volumes of column buffer. Three bed volumes of the 
DTT containing cleavage buffer were loaded onto the column and the 
column was incubated overnight. The next day, the target protein was 
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eluted by continuing the flow of the cleavage buffer without DTT. The 
purified proteins were analyzed for purity and integrity by SDS-PAGE and 
Western blot analysis according to standard protocols. 

Example 2: PEG-Based Dimer Peptides 

5 A. Synthesis of the aldehyde containing peptide 

The peptide was synthesized by stepwise solid phase synthesis on 
Rink amide Tentagel (0.21 mmol/g). Three equivalents of Fmoc-amino 
acids were used. The serine residue was introduced into the peptide by 
either coupling Fmoc-Ser(tBu)-OH to the N-terminal peptide or coupling 

10 Boc-Ser(tBu) to a selectively protected lysine side-chain. The peptide was 
then deprotected and cleaved from the resin by treatment with 95% TFA 
(trifluoroacetic acid; aq) containing TIS (triisopropylsilan). Periodate 
oxidation, using 2 equivalent of Nal0 4 in 20% DMSO (dimethyl sulfoxide)- 
80% phosphate buffer pH 7.5 (45 ul/nmol peptide) for 5 min at room 

15 temperature (RT), converted the 2-amino alcohol moiety in an a-oxoacyl 
group. The peptide was purified immediately following oxidation. 

B. Synthesis of the PEG-based dimer 

The unprotected and oxidized peptide (4.2 equivalent) was dimerized 
on the dioxyamino-PEG (polyethylene glycol)-linker (1 equivalent) in 90% 

20 DMSO-10% 20 mM NaOAc buffer, pH 5.1 (4.2 nl/umol peptide). The 
solution was left for 1 h at 38°C and the progress of the reaction was 
monitored by MALDI-MS (matrix-assisted laser desorption/ionization mass 
spectrometry). Following this, the crude dimer was purified by semi- 
preparative HPLC (high performance liquid chromatography). 

25 The molecular weights and inter peptide distance of various linkers is 

shown in Table 3, below. 

TABLE 3 
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Structure 



Number 



MW 



MW (- 2H 2 0) 



100.1 



64.1 



58.04 



22.04 




149.15 



113.15 




150.14 



114.14 




^ // 




134.13 



98.13 



6 



134.13 



98.13 



\\ / 



4 



134.13 



98.13 




8 



234.25 



198.25 




9 




10 

12" 



302.3 



72.06 
86.09 
114.14 



266.3 



36.06 
50.09 
78.14 
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Gly-NH 2 


15 
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16 
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NH 2 0~°^O~ 0NH ' 


17 
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144.2 


NH > 0-~M-°-~-~ko^° NH ' 
* n = 2 


18 


224.3 


188.3 


n = 3 


19 


268.3 


232.3 


NH ! 0^' 0 ^-0^ ONH ' 
n = 4 


20 


312.4 


276.4 




Z1 


278.4 


242.4 




22 


240.3 


204.3 




23 


240.3 


204.3 




24 


210.2 


192.2 



Example 3: Determination of Insulin Receptor Binding 

IR was incubated with 125 l-labeled insulin at various concentrations of 
5 test substance and the K d was calculated. According to this method, human 
insulin receptor (HIR) or human IGF-1 receptor (HIGF-1R) was purified from 
transfected cells after solubilization with Triton X-100. The assay buffer 
contained 100 mM HEPES (pH 7.8), 100 mM NaCI, 10 mM MgCI 2 , 0.5% 
human serum albumin (HSA), 0.2% gammaglobulin and 0.025% Triton X- 
10 100. The receptor concentration was chosen to give 30-60% binding of 
2000 cpm (3 pM) of its 125 Mabeled ligand (TyrA14- 125 l-HI or Tyr31- 125 l-IGF1) 
and a dilution series of the substance to be tested was added. After 
equilibration for 2 days at 4°C, each sample (200 pi) was precipitated by 
addition of 400 pi 25% PEG 6000, centrifuged, washed with 1 ml 15% PEG 
15 6000, and counted in a gamma-counter. 

The insulin/IGF-1 competition curve was fitted to a one-site binding 
model and the calculated parameters for receptor concentration, insulin 
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affinity, and non-specific binding were used in calculating the binding 
constants of the test substances. Representative curves for insulin 
competition are shown in Figures 10A-10C; 11A-11D. Qualitative data are 
provided in Table 4, below. 
5 Table 4 illustrates IR affinities for the RP9 monomer peptide and 

various RP9 monomer truncations. The results demonstrate that RP9 N- 
terminal sequence (GSLD; SEQ ID NO:1785) and C-terminal sequence 
(LGKK; SEQ ID NO:1786) can be deleted without substantially affecting HIR 
binding affinity (Table 4). 

10 TABLE 4 



Peptide 


SEQ ID 


Formula 


Site 


Sequence 


HIR Kd (mol/l) 




NO: 




IR 




S386 


1559 






GSLDESFYDWFERQLG 


3.2*10-? 


S395 


1787 






GSLDESFYDWFERQL 


9.1*10-8 


S394 


1788 






GSLDESFYDWFERQ 


8.1*10- 8 


S396 


1789 






GSLDESFYDWFER 


>2*10-s 


S399 


1790 






ESFYDWFERQL 


9.1*10-8 


S400 


1791 






ESFYDWFERQ 


6.3*10-' 



Figures 10A-10C demonstrate that Site 1-Site 2 heterodimer peptides 
537, 538, and 539 bound to IR with substantially higher (several orders of 
magnitude) affinity than corresponding monomer (D117 and 540) and 
15 homodimer (521 and 535) peptides. Figures 11A-11D demonstrate that Site 
1-Site 2 heterodimer peptides, 537 and 538, bound to IR with markedly 
higher affinity than the monomer peptide D1 17. 

Example 4: Adipocyte Assay for Determination of Insulin Agonist 
Activity 

20 Insulin increases uptake of 3 H glucose into adipocytes and its 

conversion into lipid. Incorporation of 3 H into the lipid phase was determined 
by partitioning of lipid phase into a scintillant mixture, which excludes water- 
soluble 3 H products. The effect of compounds on the incorporation of 3 H 
glucose at a sub-maximal insulin dose was determined, and the results 
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expressed as increase relative to full insulin response. The method was 
adapted from Moody etal., 1974, Horm. Metab. Res. 6(1): 12-6. 

Mouse epididymal fat pads were dissected out, minced into digestion 
buffer (Krebs-Ringer 25 mM HEPES, 4% HSA, 1.1 mM glucose, 0.4 mg/ml 
5 Collagenase Type 1, pH 7.4), and digested for up to 1.5 h at 36.5°C. After 
filtration, washing (Krebs-Ringer HEPES, 1% HSA), and resuspension in 
assay buffer (Krebs-Ringer HEPES, 1% HSA), free fat cells were pipetted 
into 96-well Picoplates (Packard), containing test solution and approximately 
an ED 2 o insulin. 

10 The assay was started by addition of 3 H glucose (Amersham TRK 

239), in a final concentration of 0.45 mM glucose. The assay was incubated 
for 2 h, 36.5°C, in a Labshaker incubation tower, 400 rpm, then terminated 
by the addition of Permablend/Toluene scintillant (or equivalent), and the 
plates sealed, before standing for at least 1 h and detection in a Packard 

1 5 Top Counter or equivalent. A full insulin standard curve (8 dose) was run as 
control on each plate. 

Data are presented graphically, as effect of compound on an 
(approximate) ED20 insulin response, with data normalized to a full insulin 
response. The assay can also be run at basal or maximal insulin 

20 concentration. Representative dose-response curves for insulin and IGF-1 
are shown in Figures 12-18. Qualitative data are shown in Tables 5-7. 

In free fat cell (FFC) assays, truncated synthetic RP9 monomer 
peptides S390 and S394 showed potency similar to full-length RP9 
monomer peptides (Figures 12A-12D). Truncated synthetic RP9 homodimer 

25 peptides S415 and S417 were highly potent in FFC assays, but less potent 
than full-length RP9 homodimer peptides (Figures 13A-13C; compare to 
peptides 521 and 535, described below). The potency of recombinant RP9 
homodimer peptides 521 and 535 in FFC assays is shown in Figures 14A- 
14C. The curves are flattened, suggesting that the binding mechanism may 

30 not be mediated by simple intramolecular binding (Figures 14A-14C). 
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Results further indicated that synthetic RP9 homodimer peptides 
S337 and S374 showed increased HIR biding affinity and increased potency 
in FFC assays compared to synthetic RP9 monomer. S371 (Table 5). 
Similarly, synthetic RP9 homodimer peptides S3 14 and S3 17 showed 
5 increased HIR binding affinity and increased potency in FFC assays 
compared to synthetic RP9 monomer, S371, and various RP9 truncations 
(Table 6). 



TABLE 5 



Pep. 


SEQ 
ID 

NO: 


Formula 


Site 
IR 


Monomer or 
Dimer 


Sequence 


HIR Kd 
(mol/l) 


FFC 


S371 


1558 


1 


1 


M (RP9) 


GSLDESFYDWFERQLGKK 


6.3.*10- 7 


+ 


S337 


1792 


1-1 


1-1 


D, C-Term 23 


(GSLDESFYDWFERQLGKK-Lig)2-23 


1.1*10-8 


+++++ 


S374 


1793 


1-1 


1-1 


D, N-Term 17 


1 7-(GSLDESFYDWFERQLGKK) 2 


1.8*10- 7 


++++ 



10 M = monomer; D = dimer; C-Term = C-terminal linker (C-C); N-Term = N-terminal linker (N- 
N); 23 and 17 represent specific chemical linkers (see Table 3); For FFC: 0 is no effect, + is 
agonist, - is antagonist. 

TABLE 6 



Peptide 


SEQ 
ID 

NO: 


Form. 


Site 
IR 


Mon. or Dimer 


Sequence 


HIR Kd 
(mol/l) 


FFC 


S371 
(RP9) 


1558 


1 




M 


GSLDESFYDWFERQLGKK 


6.3.*10" 7 


+ 


S395 


1787 


1 




M 


GSLDESFYDWFERQL 


9.1*10*8 


+ 


S394 


1788 


1 




M 


GSLDESFYDWFERQ 


8.1*10-8 


++ 


S396 


1789 


1 




M 


GSLDESFYDWFER 


>2*10-s 


0 


S390 


1794 


1 




M 


ESFYDWFERQLG 


6.2M0- 7 


+ 


S399 


1790 


1 




M 


ESFYDWFERQL 


9.1*10-8 


++ 


S400 


1791 


1 




M 


ESFYDWFERQ 


6.3*1 0- 7 


0 


S415 


1795 


1-1 


1-1 


D; C-Term 


(ESFYDWFERQLGK)2-23 


1.0*10- 7 


++++ 


S417 


1796 


1-1 


1-1 


D; N-Term 


23-(ESFYDWFERQLG) 2 


9.2*1 0- 7 


+++ 



15 M = monomer; D = dimer; C-Term = C-terminal linker (C-C); N-Term = N-terminal linker (N- 
N); 23 represents a specific chemical linker (see Table 3); For FFC: 0 is no effect, + is 
agonist, - is antagonist; Form. = formula; Mon. = monomer; 

Site 1-Site 2 dimer peptides 537 and 538 were inactive in the FFC 
20 assays using the standard concentration of insulin (Figures 15A-15C). 
However, Site 1-Site 2 dimer peptides 537 and 538 were antagonists in the 
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FFC assay in the presence of a stimulating concentration of insulin (Figures 
16A-16C). In contrast, Site 2-Site 1 dimer peptide 539 was a full agonist in 
the FFC assay, with a slope similar to that of insulin (Figures 17A-17B). 

Additional experiments confirmed that FFC assay activity of Site 1- 
5 Site 2 dimer peptides was affected by the orientation of the monomer 
subunits (Figures 18A-18D). In particular, dimer peptides comprising Site 1 
(S372 or S373) and Site 2 (S451 or S452) monomer subunits exhibited 
antagonist activity in the Site 1-Site 2 orientation (C-N linkage) (dimer 
peptide S453); moderate levels of agonist activity in the Site 1-Site 2 

10 orientation (N-N or C-C linkage) (dimer peptides S454 and S456); and high 
levels of agonist activity in the Site 2-Site 1 orientation (C-N linkage) (dimer 
peptide S455) (Figures 18A-18D). 

Table 7, below, shows the HIR binding affinity and FFC assay 
potency of various synthetic peptides, including Site 1-Site 1 dimer peptides 

15 S325, S329, S332; S333, S334, S335, S336, S337, S349, S350, S351, 
S352, S353, S354, S361, S362, S363, S374, S375, S376, S378, S379, 
S380, S381, S414, S415, S416, S417, S418, S420, and S424. These 
synthetic dimer peptides exhibited properties comparable to dimer peptides 
521 and 535, regardless of the orientation of the monomer subunits. In 

20 particular, synthetic Site 1-Site 2 dimer peptides S425, S453, and S459 
exhibited antagonist properties comparable to those of the Site 1-Site 2 
dimer peptides 537 and 538. Synthetic Site 1-Site 2 dimer peptides S455, 
S457, and S458 exhibited agonist properties comparable to the dimer 
peptide 539. Synthetic Site 1-Site 2 dimer peptides S436, S437, S438, 

25 S454, S456 act as partial agonists in the FFC assay (i.e., the peptides 
exhibit a maximal response of less than 100% that of insulin), which is 
shown in the table as "++" and "+++". 

Table 7 also shows properties of truncated monomer and dimer 
peptides, and thereby indicates which N- or C-terminal residues can be 

30 deleted without substantial loss of HIR binding affinity (e.g., see synthetic 
peptides S386 through S392, S394 through S403, and S436 through S445). 
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Notably, certain Site 2-Site 1 dimers show IR affinities of 2*10" 11 (see, e.g., 
S519 and S520). These peptides are also very potent in the fat cell assay 
(Figures 31A-31B) and even more potent in the HIR kinase assay (Figures 
32A-32B) (kinase assay described below). 
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Results further indicated that S175-S175 dimer peptides (Site 1-Site 
1) were less agonistic than S175 monomer peptides (++ vs. +++). S175- 
S175 dimer peptides having a C-N linkage were less agonistic or equally 
agonistic as compared to S175-S175 dimer peptides having C-C or N-N 
5 linkages. F8-F8 dimer peptides, like the parent monomer, showed no 
agonist activity. 

Example 5: Substrate Phosphorylation Assay (HIR Kinase) 

WGA (wheat germ agglutinin)-purified recombinant human insulin 
receptor was mixed with either insulin or peptide in varying concentrations in 

10 substrate phosphorylation buffer (50 mM HEPES (pH 8.0), 3 mM MnCI 2 , 10 
mM MgCI 2 , 0.05% Triton X-100, 0.1% BSA, 12.5 jaM ATP). A synthetic 
biotinylated substrate peptide (Biotin-KSRGDYMTMQIG) was added to a 
final concentration of 2 |ig/ml. Following a 1 h incubation at RT, the 
reactions were stopped by the addition of 50 mM EDTA. The reactions were 

15 transferred to Streptavidin coated 96-well microtiter plates (NUNC, Cat. No. 
236001) and incubated for 1 h at RT. The plates were washed 3 times with 
TBS (10 mM Tris (pH 8.0), 150 mM NaCI). 

Subsequently, a 2000-fold dilution of horseradish peroxidase (HRPO) 
conjugated phosphotyrosine antibody (Transduction Laboratories, Cat. No. 

20 E120H) in TBS was added. The plates were incubated for 30 min and 
washed 3 times with TBS. TMB (3,3\5,5'-tetramethylbenzidine; Kem-En- 
Tec, Copenhagen, Denmark) was added. One substrate from Kem-En-Tec 
was added. After 10-15 min, the reaction was stopped by the addition of 1% 
acetic acid. The absorbance, representing the extent of substrate 

25 phosphorylation, was measured in a spectrophotometer at a wavelength of 
450 nM. 

The results indicated that the potency of the Site 1-Site 2 dimer, 
peptide 539, was 0.1 to 1% of that of insulin in all assays tested (Table 8), 
and the dose-response curves (Figures 17A-17B) had a shape similar to 
30 that of insulin dose-response curves, suggesting an insulin-like action 
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mechanism. In addition, Site 1-Site 2 dimer peptides 537 and 538 were also 
active as specific insulin receptor antagonists (Table 8; Figures 16A-16C). 
Notably, Site 2-Site 1 dimer peptide 539 was more active in the kinase 
assay than Site 1-Site 1 homodimer peptides 521 and 535 (Figures 19A- 
5 19B), despite lower FFC potency (Figures 14A-14C; Figures 17A-17B). 
Similar results are shown in Figures 20A-B and Figures 21A-B. This data 
suggested that homodimer and heterodimer peptides used different 
mechanisms of action. 

TABLE 8 



10 



Pep. 


Mon./ 
Link. 


Sequence 


SEQ ID 
NO: 


Form 


Site 
IR 


HIR 
Kd 

(nM) 


HIGF- 
1R Kd 
(nM) 


FFC 
Pot. 
(nM) 


Kinase 

Pot. 

(nM) 


HI 








na 


na 










HIGF 
-1R 








na 


na 










521 


RP9- 
6aa- 
RP9 


MADYKDDDDKGSLDESFYDWFE 
RQLGKKGGSGGSGSLDESFYDW 
FERQLGKKAAA(ETAG)PG 


2112 


1-1 


1-1 


25 




A 3 


1400 


535 


RP9- 
12aa 
-RP9 


MADYKDDDDKGSLDESFYDWFE 
RQLGKKGGSGGSGGSGGSGSL 
DESFYDWFERQLGKKAAA(ETAG 
)PG 


2113 


1-1 


1-1 


15 




A 2 


1000 


537 


RP9- 
6aa- 
D8 


MADYKDDDDKGSLDESFYDWFE 
RQLGKKGGSGGSWLDQEWAWV 
QCEVYGRGCPSAAA(ETAG)PG 


2114 


1-6 


1-2 


0.092 


980 


N 10 


Inactive 


538 


RP9- 
12aa 
-D8 


MADYKDDDDKGSLDESFYDWFE 
RQLGKKGGSGGSGGSGGSWLD 
QEWAWVQCEVYGRGCPSAAA(E 
TAG)PG 


2115 


1-6 


1-2 


0.080 


710 


N 10 


Inactive 


539 


D8- 

6aa- 

RP9 


MADYKDDDDKWLDQEWAWVQC 
EVYGRGCPSGGSGGSGSLDESF 
YDWFERQLGKKAAA(ETAG)PG 


2116 


6-1 


2-1 


0.530 


1500 


A 10 


110 



A = agonist; N = antagonist; na = not applicable; Form. = formula; Mon. = constituent 
monomers; Link. = linker; Pot. = potency; HI and HIGF-1R are controls; All with tags at both 
ends; All dimers are linked C-N; Linker sequences are underlined. 



15 Example 6: IR Autophosphorylation Assays 

IR activation was assayed by detecting autophosphorylation of an 
insulin receptor construct transfected into 32D cells (Wang et a/., 1993, 
Science 261:1591-1594; clone 969). The IR transfected 32D cells were 
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seeded at 5 x 10 6 cells/well in 96-well tissue culture plates and incubated 
overnight at 37°C. Samples were diluted 1:10 in the stimulation medium 
(PRIM1640 with 25 nM HEPES pH 7.2) plus or minus insulin. The culture 
media was decanted from the cell culture plates, and the diluted samples 
5 were added to the cells. The plates were incubated at 37°C for 30 min. The 
stimulation medium was decanted from the plates, and cell lysis buffer (50 
mM HEPES pH 7.2, 150 mM NaCI, 0.5% Triton X-100, 1 mM AEBSF, 10 
KlU/ml aprotinin, 50 uM leupeptin, and 2 mM sodium orthovanadate) was 
added. The cells were lysed for 30 min. 

10 In the ELISA portion of the assay, the cell lysates were added to the 

BSA-blocked anti-IR unit mAb (Upstate Biotechnology, Lake Placid, NY) 
coated ELISA plates. After a 2 h incubation, the plates were washed 6 
times with PBST and biotinylated anti-phosphotyrosine antibody (Upstate 
Biotechnology) is added. After another 2 h incubation, the plates were 

15 again washed 6 times. Streptavidin-Eu was then added, and the plates 
were incubated for 1 h. After washing the plates again, EG&G Wallac 
enhancement solution (100 mM acetone-potassium hydrogen pthalate, pH 
3.2; 15 mM 2-naphtyltrifluoroacetate; 50 mM tri(n-octyl)-phosphine oxide; 
0.1% Triton X-100) was added into each well, and the plates were placed 

20 onto a shaker for 20 min at RT. Fluorescence of samples in each well was 
measured at 615 nm using a VICTOR 1420 Multilabel Counter (EG&G 
Wallac). 

Alternatively, IR autophosphorylation was determined using a 
holoenzyme phosphorylation assay. In accordance with this assay, 1 pi of 

25 purified insulin receptor (isolated from a Wheat Germ Agglutinin Expression 
System) was incubated with 25 nM insulin, or 10 or 50 uM peptide in 50 pi 
autophosphorylation buffer (50 mM HEPES pH. 8.0, 150 mM NaCI, 0.025% 
Triton-X-100, 5 mM MnCI 2 , 50 pM sodium orthovanadate) containing 10 pM 
ATP for 45 min at 22°C. The reaction was stopped by adding 50 pi of gel 

30 loading buffer containing p-mercaptoethanol (Bio-Rad Laboratories, Inc., 
Hercules, CA). The samples were run on 4-12% SDS-polyacrylamide gels. 
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Western Blot analysis was performed by transferring the proteins onto 
nitrocellulose membrane. The membrane was blocked in PBS containing 
3% milk overnight. The membrane was incubated with anti-phosphotyrosine 
4G10 HRP labeled antibody (Upstate Biotechnology) for 2 h. Protein bands 
5 were visualized using SuperSignal West Dura Extended Duration Substrate 
Chemiluminescence Detection System (Pierce Chemical Co.). 

Example 7: Fluorescence-Based HIR Binding Assays 

A. Time-Resolved Fluorescence Resonance Energy Transfer 
Assays 

10 Time-resolved fluorescence resonance energy transfer assays (TR- 

FRET) were used for peptide competition studies. In one set of assays, 
monomer and dimer peptides were tested for the ability to compete with 
biotinylated RP9 monomer peptide (b-RP9) for binding to HIR- 
immunoglobulin heavy chain chimera (sIR-Fc; Bass et a/., 1996). The 

15 assays were performed using a 384-well white microplate (NUNC) with a 
final volume of 30 jil. Final incubation conditions were in 22 nM b-RP9, 1 
nM SA-APC (streptavidin-allophycocyanin), 1 nM Eu 3+ -slR-Fc (LANCE™ 
labeled, PE Wallac, Inc.), 0.05 M Tris-HCI (pH 8 at 25°C), 0.138 M NaCI, 
0.0027 M KCI, and 0.1% BSA (Cohn Fraction V). After 16-24 h of incubation 

20 at RT, the fluorescence signal at 665 nm and 620 nm was read on a Victor 2 
1420 plate reader (PE Wallac, Inc.). Primary data were background 
corrected, normalized to buffer controls, and then expressed as percent of 
specific binding. 

Results are shown in Figures 22A-22B. Figure 21 A shows b-RP9 
25 competition data. For these figures, the Z'-factor was greater than 0.5 (Z' = 
1-(3ct + +3ct.)/|h + -u-|; Zhang et a/., 1999, J. Biomol. Screen. 4:67-73), and the 
signal-to-background (S/B) ratio was -4-5. In Figure 22A, each data point 
represents the average of two replicate wells. The lines represent the best 
fit to a four-parameter non-linear regression analysis of the data according 



WO 03/027246 



PCT/US02/30412 



-124- 

to the following formula: y = min + (max-mln)/(1+10 A ((loglC 5 o-x)*Hillslope)). 
This was used to determine IC50 values. 

In another set of assays, monomer and dimer peptides were tested 
for the ability to compete with biotinylated-S175 (b-S175) or b-RP9 for 
5 binding to sIR-Fc. The TR-FRET assays were performed in a 384-well white 
microplate with a final volume of 30 Final incubation conditions were in 
33 nM b-S175 or 22 nM b-RP9, 1 nM SA-APC, 1 nM Eu 3+ -slR-Fc, 0.05 M 
Tris-HCI (pH 8 at 25°C), 0.138 M NaCI, 0.0027 M KCI, and 0.1% BSA. After 
16-24 h of incubation at RT, the fluorescence signal at 665 nm and 620 nm 
10 was read on a Victor 2 1420 plate reader. Primary data were background 
corrected, normalized to buffer controls, and then expressed as % specific 
binding. 

Results are shown in Figures 23A-23B. For these figures, each data 
point represents the average of two replicate wells. The lines represent the 
15 best fit to a four-parameter non-linear regression analysis of the data, which 
was used to determine IC50 values. Figure 23A shows b-S175 competition 
data; Figure 23B shows b-RP9 competition data. 

B. Fluorescence Polarization Assays 

Fluorescence polarization assays (FP) were used for peptide 
20 competition studies. In one set of assays monomer and dimer peptides 
were tested for the ability to compete with fluorescein-RP9 (FITC-RP9) for 
binding to soluble HIR ectodomain (sIR; Kristensen et ai, 1998, J. Biol. 
Chem. 273:17780-17786). The assays were performed in a 384-well black 
microplate (NUNC) with a final volume of 30 ul. Final incubation conditions 
25 were 1 nM FITC-RP9, 10 nM sIR, 0.05 M Tris-HCI (pH 8 at 25°C), 0.138 M 
NaCI, 0.0027 M KCI, 0.05% BGG (bovine gamma globulin), 0.005% Tween- 
20®. After 16-24 h of incubation at RT, the fluorescence signal at 520 nm 
was read on an Analyst™ AD plate reader (LJL BioSystems, Inc.). Primary 
data were background corrected using 10 nM sIR without FITC-RP9 
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addition, normalized to buffer controls, and then expressed as percent of 
specific binding. The Z'-factor was greater than 0.5 and the assay dynamic 
range was -125 mP. In Figures 24-27, each data point represents the 
average of two replicate wells. The lines represent the best fit to a four- 
5 parameter non-linear regression analysis of the data, which was used to 
determine IC 50 values. The Z'-factor was greater than 0.5 and the assay 
dynamic range was -125 mP. Results are shown in Figures 24A-24B. 

In another set of assays, monomer and dimer peptides were tested 
for the ability to compete with FITC-RP9 for binding to soluble human insulin 

10 mini-receptor (mIR; Kristensen et a/., 1999, J. Biol. Chem. 274:37351- 
37356). The FP assays were performed in a 384-well black microplate with 
a final volume of 30 Final incubation conditions were 2 nM FITC-RP9, 20 
nM mIR, 0.05 M Tris-HCI (pH 8 at 25°C), 0.138 M NaCI, 0.0027 M KCI, 
0.001% BGG, 0.005% Tween-20®. After 16-24 h of incubation at RT, the 

15 fluorescence signal at 520 nm was read on an Analyst™ AD plate reader. 
Primary data were background corrected using 20 nM mIR without FITC- 
RP9 addition, normalized to buffer controls and then expressed as percent 
of specific binding. Results are shown in Figures 25A-25B. 

Monomers and dimer peptides were also tested for the ability to 

20 compete with fluorescein-insulin (FITC-lnsulin) for binding to sIR. The FP 
assays were performed in a 384-well black microplate with a final volume of 
30 jil. Final incubation conditions were in 2 nM FITC-lnsulin, 20 nM sIR, 
0.05 M Tris-HCI (pH 8 at 25°C), 0.138 M NaCI, 0.0027 M KCI, 0.05% BGG, 
0.005% Tween-20®. After 16-24 h of incubation at RT, the fluorescence 

25 signal at 520 nm was read on an Analyst™ AD plate reader. Primary data 
were background corrected using 20 nM sIR without FITC-lnsulin addition, 
normalized to buffer controls and then expressed as percent of specific 
binding. Results are shown in Figures 26A-26B. 

In other assays, peptide monomers and dimer peptides were tested 

30 for the ability to compete with FITC-lnsulin for binding to mIR. The FP 
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assays were performed in a 384-well black microplate with a final volume of 
30 pi. Final incubation conditions were 2 nM FITC-lnsulin, 20 nM mIR, 0.05 
M Tris-HCI (pH 8 at 25°C), 0.138 M NaCI, 0.0027 M KCI, 0.05% BGG 
(bovine gamma globulin), 0.005% Tween-20®. After 16-24 h of incubation at 
5 RT, the fluorescence signal at 520 nm was read on an Analyst™ AD plate 
reader. Primary data were background corrected using 20 nM mIR without 
FITC-RP9 addition, normalized to buffer controls and then expressed as % 
specific binding. Results are shown in Figures 27A-27B. 

C. Summary 

10 Table 9, below, summarizes the binding data calculated from 

competition assays using the IR constructs, sIR-Fc, sIR, and mIR, in TR- 
FRET and FP formats. The data in Table 9 indicate that most dimer 
peptides (e.g., S291 and S375 or S337), showed greater agonist activity 
than the corresponding monomer peptides (e.g., H2C or RP9, respectively) 

15 in the FFC assay. It was previously demonstrated that an inequality 
between monomer peptides and insulin was exhibited in competition assays 
where the assay reporter was a monomer peptide (i.e., RP9 or S175). This 
inequality was also demonstrated by dimer peptides as seen in Table 9. 
Table 9 further shows that Group 6 monomer peptides such as E8 (D120) 

20 were able to compete with FITC-RP9 or b-RP9 peptides for binding to sIR- 
Fc, but did not compete peptide ligands, such as FITC-RP9 for binding to 
mIR. Experiments using different IR constructs thereby allowed 
differentiation of Site I peptides based on sequence motifs (i.e., Group 6 
(Formula 10) vs. Group 1 (Formula 1; A6)). 
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z 
z 
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C-C 






C-C 




C-C 












SEQ ID 

NO: 




2117 


1916 
and 
1917 


1558 


1994 


1960 
and 
1961 


2008 


1794 


2015 
and 
2016 


1560 


2001 
and 
2002 


2118 










Monomer 
or Dimer 




| H2C 


S291 


| RP9 


I S375 


S337 


S391 


I S390 


S414 


I S175 


S380 


© 

CM 

a 

CO 
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Based on the functional studies outlined above, the following peptide 
dimers were designed. 



SEQ 

ID 

NO: 


Monom./ 
Linkers 


Sequence 


2119 


F8-6aa- 
RP9 


HLCVLEELFWGASLFGYCSGGGSGGSGSLDESFYDWFERQL 


2120 


F8-12aa- 
RP9 


HLCVLEELFWGASLFGYCSGGGSGGSGGSGGSGSLDESFYDWFERQL 


2121 


D8-6aa- 
S175 


WLDQEWAWVQCEVYGRGCPSGGSGGSGRVDWLQRNANFYDWFVAELG 


2122 


D8-12aa- 
S175 


WLDQEWAWVQCEVYGRGCPSGGSGGSGGSGGSGRVDWLQRNANFYDWFVAELG 


2123 


F8-6aa- 
S175 


HLCVLEELFWGASLFGYCSGGGSGGSGRVDWLQRNANFYDWFVAELG 


2124 


F8-12aa- 
S175 


HLCVLEELFWGASLFGYCSGGGSGGSGGSGGSGRVDWLQRNANFYDWFVAELG 


2125 


D8-6aa- 
RP15 


HLCVLEELFWGASLFGYCSGGGSGGSSQAGSAFYAWFDQVLRTV 


2126 


D8-6aa- 
RP6 


HLCVLEELFWGASLFGYCSGGGSGGSTFYSCLASLLTGTPQPNRGPWERCR 


2127 


D8-6aa- 
RP17 


HLCVLEELFWGASLFGYCSGGGSGGSQSDAFYSGLWALIGLSDG 


2128 


D8-6aa- 
Grp6 


HLCVLEELFWGASLFGYCSGGGSGGSDSDWAGYEWFEEQLD 



5 Linker sequences are underlined and in bold; Monomer sequences are shown below; All 
dimers are linked C-N. 



SEQ ID NO: 


Monomer 


Formula 


Site 


Sequence 


1576 


F8 


4 


2 


HLCVLEELFWGASLFGYCSG 


1558 


RP9 


1 


1 


GSLDESFYDWFERQL 


2129 


D8 


6 


2 


WLDQEWAWVQCEVYGRGCPS 


1560 


S175 


1 




GRVDWLQRNANFYDWFVAELG 


2130 


RP15 


1 




SQAGSAFYAWFDQVLRTV 


1635 


Rp6 


2 




TFYSCLASLLTGTPQPNRGPWERCR 


2131 


RP17 


1 




QSDAFYSGLWALIGLSDG 


1595 


Group 6 


10 




DSDWAGYEWFEEQLD 



Example 8: Peptide Fusions To The Maltose Binding Protein 

10 A. Cloning 

The transfer of interesting peptide sequences from phage display to 
maltose binding protein (MBP) fusions is desirable for several reasons. 
First, to obtain a more sensitive affinity estimate, the polyvalency of phage 
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display peptides should be converted to a monovalent system. For this 
purpose, the peptide sequences are fused to MBP that generally exists as a 
monomer with no cysteine residues. Second, competition experiments can 
be carried out with the same or different peptides, one phage displayed and 
5 the other fused to MBP. Lastly, purified peptides can be obtained by 
cleavage of the fusion protein at a site engineered in the DNA sequence. 

Figure 28 shows a schematic drawing of the MBP-peptide construct. 
In the construct, the N-terminus of the peptide sequence is fused to the C- 
terminus of the MBP. Two peptide-flanking epitope tags are included, a 

10 shortened-FLAG® at the N-terminus and E-Tag at the C-terminus. The 
corresponding gene fusion was generated by ligating a vector fragment 
encoding the MBP in frame with a PGR product encoding the peptide of 
interest. The vector fragment was obtained by digesting the plasmid pMAL- 
c2 (New England Biolabs) with EcoRI and Hind\\\ and then treating the 

15 fragment with shrimp alkaline phosphatase (SAP; Amersham). The 
digested DNA fragment was resolved on a 1% agarose gel, excised, and 
purified by QIAEXII (QIAGEN). The 20-amino acid peptide sequences of 
interest were initially encoded in the phage display vector pCANTABSE 
(Pharmacia). To obtain these sequences, primers were synthesized which 

20 anneal to sequences encoding the shortened FLAG® or E-Tag epitopes and 
also contain the required restriction enzyme sites EcoRI and HindUl. PCR 
products were obtained from individual phage clones and digested with 
restriction enzymes to yield the insert fragment. The vector and insert were 
ligated overnight at 15°C. The ligation product was purified using QIAquick 

25 spin columns (QIAGEN) and electroporations were performed at 1500 v in 
an electroporation cuvette (0.1 mm gap; 0.5 ml volume) containing 10 ng of 
DNA and 40 pi of E. coli strain ER2508 (RR1 lon:miniTn10(Tet r ) (malB) 
(argF-lac)U169 Pro + zjc: :Tn5(Kan r ) fhuA2) electrocompetent cells (New 
England Biolabs). Immediately after the pulse, 1 ml of pre-warmed (40°C) 

30 2xYT medium containing 2% glucose (2xYT-G) was added and the 
transformants were grown at 37°C for 1 h. Cell transformants were plated 
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onto 2xYT-AG plates and grown overnight at 37°C. Sequencing confirmed 
the clones contained the correct constructs. 

B. Small-Scale Expression of Soluble MBP-Peptide Fusion 
Proteins 

5 E. coli ER2508 (New England Biolabs) carrying the plasmids 

encoding MBP-peptide fusion proteins were grown in 2xYT-AG at 37°C 
overnight (250 rpm). The following day the cultures were used to inoculate 
media (2x YT containing-G) to achieve an OD 60 o of 0.1. When the cultures 
reached an OD 6 oo of 0.6, expression was induced by the addition of IPTG to 
10 a final concentration of 0.3 mM and then cells were grown for 3 h. The cells 
were pelleted by centrifugation and samples from total cells were analyzed 
by SDS-PAGE electrophoresis. The production of the correct molecular 
weight fusion proteins was confirmed by Western blot analysis using the 
monoclonal antibody anti-E-Tag-HRP conjugate (Pharmacia). 

1 5 C. Large-Scale Expression of Soluble MBP-Peptide Fusion 

Proteins 

E. coli ER2508 carrying plasmids encoding the MBP-peptide fusion 
proteins were grown in 2xYT-AG media for 8 h (250 rpm, 37°C). The 
cultures were subcultured in 2xYT-AG to achieve an OD 6 oo of 0.1 and grown 

20 at 30°C overnight. This culture was used to inoculate a fermentor with 
medium of following composition (g/l): glucose (3.00); (NH 4 ) 2 S0 4 5.00; 
MgS0 4 • 7H 2 0 (0.25); KH 2 P0 4 (3.00); citric acid (3.00); peptone (10.00); 
and yeast extract (5.00); pH 6.8. 

The culture was grown at 700 rpm, 37°C until the glucose from the 

25 medium was consumed (OD 6 oo = ~6.0 - 7.0). Expression of the fusion 
protein was induced by the addition of 0.3 mM IPTG and the culture was 
grown for 2 h in fed-batch mode fermentation with feeding by 50% glucose 
at a constant rate of 2 g/l/h. The cells were removed from the medium by 
centrifugation. Samples of the cell pellet were analyzed by SDS-PAGE 



WO 03/027246 



PCT/US02/30412 



-131- 

followed by the Western blot analysis using the mouse monoclonal antibody 
anti-E-Tag-HRP conjugate (Pharmacia) to visualize the expressed product. 

D. Purification 

The cell pellets were disrupted mechanically by sonication or 
5 chemically by treatment with the mild detergent Triton X-100. After removal 
of cell debris by centrifugation, the soluble proteins were prepared for 
chromatographic purification by dilution or dialysis into the appropriate 
starting buffer. The MBP fusions were initially purified either by amylose 
affinity chromatography or by anion exchange chromatography. Final 

10 purification was performed using anti-E-Tag antibody affinity columns 
(Pharmacia). The affinity resin was equilibrated in TBS (0.025 M Tris- 
buffered saline, pH 7.4) and the bound protein was eluted with Elution buffer 
(100 mM glycine, pH 3.0). The purified proteins were analyzed for purity 
and integrity by SDS-PAGE and Western blot analysis according to standard 

15 protocols. 

For MBP fusions, IR agonist activity was observed for the Site 1-Site 
1 dimer peptides shown in Table 10, below. Additional binding data for the 
MBP fusions are shown in Table 1 1 , also below. 
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E. BIAcore Analysis 

For BIAcore analysis of fusion protein and synthetic peptide binding to 
insulin receptor, insulin (50 ^g/ml in 10 mM sodium acetate buffer pH 5) was 
immobilized on the CMS sensor chip (Flowcell-2) by amine coupling. 
5 Flowcell-1 was used for background binding to correct for any non-specific 
binding. Insulin receptor (450 nM) was injected into the flow cell and the 
binding of IR to insulin was measured in resonance units (RUs). Receptor 
bound to insulin gave a reading of 220 RU. The surface was regenerated 
with 25 mM NaOH. Pre-incubation of receptor with insulin in a tube at RT 

10 completely abrogated the response units to 16 RU. Thus, the system was 
validated for competition studies. Several maltose-binding fusion proteins, 
peptides, and rVabs were pre-incubated with insulin receptor before injecting 
over the insulin chip for competition studies. The decrease in 
binding/resonance units indicates that several MBP-fusion proteins can block 

15 the insulin-binding site. The results are shown in Tables 12 and 13. The 
amino acid sequences referred to in the tables are identified in Figures 8 and 
9A-9B, except the 447 and 2A9 sequences, which are shown below. 

TABLE 12 



BIAcore Results — Fusion Proteins Compete for Binding to IR 





Incubation Mixtures 


Result (RUs) 


Sequence Type 


Controls 


Insulin Receptor (IR) 450 nM 


220 


Positive Control 




Insulin ( 8.7 pM) 


16 


Negative Control 


MBP Fus. Prots. 


A7 (20A4)-MBP (4.1 ljM) + IR 


43 


Formula 6 Motif 




D8-MBP(1.6 gM) + IR 


56 


Formula 6 Motif 




D10-MBP (3.4 uM)+ IR 


81 


Formula 1 1 Motif 




447-MBP (11.5gM) + IR 


195 


hGH Pept. Fus. 




MBP (13 uM) + IR 


209 


Negative Control 



20 

The A7 (20A4), D8, and D10 peptide sequence are shown in Figures 8 and 9A-9B. The 447 
peptide sequence is: LCQRLGVGWPGWLSGWCA (SEQ ID NO:2156). 

TABLE 13 



25 



BIAcore Results - Synthetic peptides compete for binding to IR 
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Incubation Mix 


% Binding 


Result (RUs) 


Sequence Type 


IR 


100 


128 


Positive control 


IR + 20D1 


41 


51.8 


Formula 1 Motif 


IR +D8 


33 


41.6 


Formula 6 Motif 


IR + 20C11 


38 


49 


Formula 2 Motif (bkq hiqh) 


IR + H2 


27 


34.6 


IGF (phosphorylated band) 


IR + 2A9 


100 


128 


IGF(bkg high) 


IR + 20A4 


33 


41.8 


Formula 6 Motif 


IR + p53wt 


97 


124.5 


P53 wild type 



The concentration of each peptide was about 40 pM and the concentration of IR 
was 450 nM. The 20D1, 20A4, and D8 peptide sequences are shown in Figures 8 and 9A- 
9B. The remaining peptide sequences are as follows: 447 = LCQRLGVGWPGWLSGWCA 
5 (SEQ ID NO:2156); 2A9 = LCQSWGVRIGWLTGLCP (SEQ ID NO:2157); 20C11 = 
DRAFYNGLRDLVGAVYGAWD (SEQ ID NO: 1659); H2 = VTFTSAVFHENFYDWFVRQVS 
(SEQ ID NOM784). 



Regarding preparation of a Site 1 agonist comprising two D1 17 (H2C) 
10 peptides, a linker of only 3 amino acids (12 A) provided a ligand of greater 
affinity for Site 1 of IR than a corresponding ligand prepared with a 9 amino 
acid (36 A) linking region (Figure 29). 

F. Stimulation of Autophosphorylation of IR by MBP-Fusion 
Peptides 

15 MBP fusion peptides were prepared as described above, and then 

assayed for autophosphorylation of a insulin receptor construct transfected 
into 32D cells (Wang et a/., 1993; clone 969) (see Example, above). The 
results of these experiments shown in Figure 30 indicate that the H2C 
monomer and H2C-H2C homodimer peptides stimulate autophosphorylation 

20 of IR in vivo. H2C dimer peptides (Site 1-Site 1) with a 6 amino acid linker 
(H2C-6aa-H2C) were most active in the autophosphorylation assay. Other 
active dimer peptides are also shown in Figure 30, particularly H2C-9aa- 
H2C, H2C-12aa-H2C, H2C-3aa-H2C, and F8. 



G. Insulin Receptor Binding Affinity and Fat Cell Potency of 
25 MBP-Fusion Peptides 

Results of assays to determine binding affinity for insulin receptor and 
fat cell potency of the MBP-fusion peptides are shown in Table 14, below. 
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CO 
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Sequence 


MBP...NNNNLGIEGRISEFIEGR AQPAMA WLDQEWAWVQCEVYGRGCPS AAA(ETAG)AA 


MBP...NNNNLGIEGRISEFIEGRAQPAMAWLDQEWAWVQCEVYGRGCPSGGSGGSKWLDQEWAWVQCEVYGRGCPSA 


MBP...NNNNLGIEGRISEFIEGRDYKDDDOKHLCVLEELFWGASLFGYCSGAAA(ETAG)AA 


MBP...NNNNLGIEGRISEFIEGRDYKDDDDK HLCVLEELFWGASLFGYCSGGGSGGS HLCVLEELFWGASLFGYCSGAAA(ETAG)AA 


MBP...NNNNLGIEGRISEFIEGRDYKDDDDKHLCVLEELFWGASLFGYCSGGGSGGSGGSGGS 
HLCVLEELFWGASLFGYCSGAAA(ETAG)AA 


MBP...NNNNLGIEGRISEFIEGRDYKDDDDK HLCVLEELFWGASLFGYCSGGGSGGS HLCVLEELFWGASLFGYCSGGGSGGS 
HLCVLEELFWGASLFGYCSGAAA(ETAG)AA | 


MBP...NNNNLGIEGRISEFIEGRAQPAMA FHENFYDWFVRQVSAAA(ETAG)AA 
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Example 9: In Vivo Assays for Insulin Agonists 

To test the in vivo activity of dimer peptide S519, an intravenous 
blood glucose test was carried out on Wistar rats. Male Mol:Wistar rats, 
weighing about 300 g, were divided into two groups. A 10 pi sample of 
5 blood was taken from the tail vein for determination of blood glucose 
concentration. The rats were anaesthetized with Hypnorm/Dormicum at t = - 
30 min and blood glucose was measured again at t = -20 min and at t = 0 
min. After the t = 0 sample was taken, the rats were injected into the tail 
vein with vehicle or test substance in an isotonic aqueous buffer at a 
10 concentration corresponding to a 1 ml/kg volume of injection. Blood glucose 
was measured at times 10, 20, 30, 40, 60, 80, 120, and 180 min. The 
Hypnorm/Dormicum administration was repeated at 20 min intervals. 
Results shown in Figure 33 demonstrate that the S519 (at 20 nmol/kg) 
peptide lowered blood glucose levels similar to levels observed for human 
15 insulin (at 2.5 nmol/kg) (n=8). The S519 peptide and human insulin showed 
comparable in vivo effects, both in magnitude and onset of response (Figure 
33). 

Example 10: IGF-1 Binding Peptides 

Three major groups of peptide IGF-1 -binding peptides were obtained from 
IGF-1 R panning experiments: Site 1 A6 (FyxWF) (SEQ ID NO:1596); Site 1 
B6 (FyxxLxxL) (SEQ ID NO:1732), and Site 2 (cysteine loops). See Beasley 
et al. International Application PCT/US00/08528, filed March 29, 2000, and 
Beasley et al., U.S. Application Serial No. 09/538,038, filed March 29, 2000. 
Active peptides included 20E2 and RP6 (B6-like; Formula 2), S175 (A6-like; 
Formula 1), G33 (A6-like; Formula 1), RP9 (A6-like; Formula 1), D815 (Site 
2), and D8B12 (Site 2) peptides. The IGF-1 binding peptides were analyzed 
by various assays, described as follows. 
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A. Phage Competition 

Phage competition studies were performed with Site 1 (RP9) and Site 
2 (D815) monomer peptides. Plates were coated with IGF-1R (100 ng/well 
in carbonate buffer, pH 9.6) overnight at 4°C. Wells were blocked with 4% 
5 non-fat milk in PBS for 60 min at room temperature. One hundred 
microliters of rescued phage were added to each well. Peptides in varying 
concentrations were added and the mixtures were incubated for 2 h at room 
temperature. Plates were washed three times with PBS and 100 nl of anti- 
Mi 3 antibody conjugated to horseradish peroxidase was added to each 
10 well. The labeled antibody was incubated at room temperature for 60 min. 
After washing, 100 u.l of ABTS was added per well and the plates read in a 
microtiter reader at 450 nM. 

Phage included RP9 (A6-like; Formula 1); RP6 (B6-like; Formula 2); 
D8B12 (Site 2); and D815 (Site 2). Peptides included RP9 and D815. 

15 



Peptide 


Formula 


Site 
IGF-1R 


Sequence 


SEQ ID 

NO: 


D8B12 


6 


2 


WLEQERAWIWCEIQGSGCRA 


1884 


D815 


6 


2 


WLDQERAWLWCEISGRGCLS 


2206 


RP6 


2 


1 


TF YS CLAS LLTGTPQ P N RG PWERCR 


1635 


RP9 


1 


1 


GSLDESFYDWFERQLG 


1559 



Results shown in Figures 34A-34E demonstrate that that RP9 and 
D815 peptides competed both Site 1 and Site 2 phage. These results 
illustrate the allosteric nature of the interaction with IGF-1 R. 
20 Phage competition studies were also performed with Site 2-Site 1 

dimer peptides containing 6- or 12-amino acid linkers. Plates were coated 
with IGF-1 R (100 ng/well in carbonate buffer, pH 9.6) overnight at 4°C. 
Wells were blocked with 4% non-fat milk in PBS for 60 min at room 
temperature. One hundred microliters of rescued phage were added to 
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each well. Peptides in varying concentrations were added and the mixture 
incubated for 2 h at room temperature. Plates were washed three times with 
PBS and 100 nl of anti-M13 antibody conjugated to horseradish peroxidase 
was added to each well. The labeled antibody was incubated for 60 min at 
5 room temperature. After washing, 100 ^l of ABTS was added per well and 
the plates read in a microtiter reader at 450 nM. Phage included RP9, RP6, 
D8B12, and D815. Peptides included D815-6L-RP9 and D815-12L-RP9. 
Linker sequences are underlined and shown below. 



Peptide 


Formula 


Site 
IGF-1R 


Sequence 


SEQ 
ID NO: 


D815-6L- 
RP9 


6-1 


2-1 


LDQERAWLWCEISGRGCLSGGSGGSGSLDESFYDWFERQLGKK 


2207 


D815- 
12L-RP9 


6-1 


2-1 


WLDQERAWLWCEISGRGCLSGGSGGSGGSGGSGSLDESFYDW 
FERQLGKK 


2208 



10 D8B12, D815, RP6, and RP9 amino acid sequences are shown in the 

previous section. Results shown in Figures 35A-35E demonstrate that 
dimers competed both Site 1 and Site 2 phage. This indicates that both 
dimer units were active at IGF-1 R. 

B. IGF-1 Proliferation Assays 

15 FDC-P2 cells expressing the IL-3 and human IGF-1 R receptors were grown 
in RPMIk-1640 medium supplemented with 15% fetal bovine serum (FBS) 
and 5% WEHI conditioned medium (containing IL-3) in accordance with 
routine methods. Prior to an experiment, the cells were pelleted and 
washed two times in PBS. Following this, cells were resuspended in RPMI- 

20 1640 medium with 2% FBS and added to a 96-well plate at a concentration 
of 2 x 10 4 cells/well in 75 \x\. This was designated as the cell plate. 

Peptides were suspended in PPM 1-1 5% FBS (test medium). For the 
agonist assay, medium was added to rows 2-12 of a 96-well plate. The 
peptide was added to row 1 in 200 jal of test medium at a final concentration 

25 of 60 jaM. The peptide was serially diluted (1:1) across rows 2-11. No 
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peptide was added to row 12 (control; cells without IGF-1). For the 
antagonist assay, test medium containing 10 ng/ml IGF-1 (ED50 test 
medium) was added to all wells of a 96-well plate. To row 1 was added 100 
pi of peptide in ED50 test medium at a concentration of 120 jxM. The peptide 
5 was serially diluted (1:1) across rows 2-1 1 . No peptide was added to row 1 2 
(control; cells with IGF-1 ). 

For both agonist and antagonist assays, 75 uJ from the working plates 
was transferred to the appropriate rows in comparable cell plates. The 
starting peptide concentration for both agonist and antagonist assays was 
10 30 nM. Each peptide was done in duplicate. Plates were incubated at 37°C 
for 45-48 h. Ten microliters of WST-1 (Cell Proliferation Reagent, Roche cat 
# 1 644 807) were added to each well and the plates were read in an ELISA 
reader (440/700 dual wavelength) each hour for 4 h. Graphs were prepared 
from the raw data using Sigma Plot. Peptides included: 

15 



Peptide 


Formula 


Site 
IGF-1 R 


Sequence 


SEQ 
ID NO: 


20E2 


2 


1 


DYKD F YDAI DQLVRGSARAGGTRD 


2209 


D815 


6 


2 


WLDQERAWLWCEISGRGCLS 


2206 


G33 


1 


1 


GIISQSCPESFYDWFAGQVSDPWWCW 


1600 


RP6 


2 


1 


TFYSCLASLLTGTPQPNRGPWERCR 


1635 


RP9 


1 


1 


GSLDESFYDWFERQLG 


1559 


S175 


1 


1 


GRVDWLQRNANFYDWFVAELG 


1560 



Results of the IGF-1 proliferation assays are shown in Figures 36-42. 
Figure 36 demonstrates that that peptides G33 (Site 1; ED 50 ~ 10 u.M) and 
D815 (Site 2; ED 50 ~ 2 u.M) showed agonist activity at IGF-1 R, whereas 
20 peptides RP9 and RP6 showed no agonist activity. Figure 37 demonstrates 
that that peptides RP6 (Site 1 ; ED 50 - 1 uM) and RP9 (Site 1 ; ED 50 ~ 7 u.M) 
showed antagonist activity at IGF-1 R, whereas peptides G33 and D815 
showed no antagonist activity. Figure 38 demonstrates that peptides S175 
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and 20E2 exhibited weak agonist activity at IGF-1R (ED 50 > 10 ^iM). Figure 
39 shows that D815-RP9 dimers with 6- or 12-amino acid linkers acted as 
agonists at IGF-1R. Figure 40 shows that dimer peptide D815-6-G33 was 
inactive as an agonist at IGF-1R. Figure 41 shows that monomer peptide 
5 RP6 acted as an antagonist at IGF-1R. The IGF-1 standard curve 
determined for FDC-P2 cells is shown in Figure 42. 

The IGF-1 R data for the Site 1 and Site 2 peptides is summarized in 
Table 15, below. 
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Example 11: Panning Peptide Libraries for IGF-1 binding proteins 
A. Panning Secondary Libraries 

Soluble IGF-1 R ("slGF-1R") was obtained from R&D Systems. The 
soluble protein (> 95% pure) included the heterotetrameric (alpha 2-beta 2) 
extracellular domain of IGF-1 R isolated from a mouse myeloma cell line. 
slGF-1 R (500 ng/well) was added to an appropriate number of wells in a 96- 
well microtiter plate (MaxiSorp plates, NUNC) and incubated overnight at 
4°C. Wells were then blocked with MPBS (PBS buffer pH 7.5 containing 2% 
Carnation® non-fat dry milk) at room temperature (RT) for 1 h. Eight wells 
were used for each round of panning for the G33 and RP6 secondary 
libraries. The phage were incubated with MPBS for 30 min at RT, then 100 
pi was added to each well. 

For the first round, the input phage titer was 4 x 10 13 cfu/ml. For 
rounds 2 and 3, the input phage titer was approximately 10 11 cfu/ml. Phage 
were allowed to bind for 2 to 3 h at RT. The wells were then quickly washed 
13 times with 200 pl/well of MPBS. Bound phage were eluted by incubation 
with 100 pl/well of 20 mM glycine-HCI, pH 2.2 for 30 s. The resulting 
solution was then neutralized with Tris-HCI, pH 8.0. Log phase TG1 cells 
were infected with the eluted phage, then plated onto two 24 cm x 24 cm 
plates containing 2xYT-AG. The plates were incubated at 30° C overnight. 
The next morning, cells were removed by scraping and stored in 10% 
glycerol at -80° C. For subsequent rounds of affinity enrichment, cells from 
these frozen stocks were grown and phage were prepared as described 
above. A minimum of 72 clones was picked at random from the second, 
third, and fourth rounds of panning and screened for binding activity. DNA 
sequencing of the clones determined the amino acid sequences 
summarized in Figure 43A-43B. 
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B. Panning Peptide Dimer Libraries 

Microtiter plates were coated and blocked by standard methods, as 
follows. Plates were coated with sIGF-IR (see Example, above) or soluble 
IR (Bass construct; Bass ef a/., 1996, J. Biol. Chem. 271:19367-19375) in 
5 0.2 M NaHC0 3 , pH 9.4. One hundred microliters of solution containing 
either 50 ng IR or IGF-1 R (rounds 1 and 2), 25 ng IR or IGF-1R (round 3), or 
12.5 ng IR or IGF-1 R (round 4) was added to an appropriate number of 
wells in a 96-well microtiter plate (MaxiSorp plates, Nalge NUNC) and 
incubated overnight at 4°C. Wells were then blocked with a solution of 2% 

10 non-fat milk in PBS (MPBS) at RT for at least 1 h. 

Eight wells coated with IR or IGF-1 R were used for each round of 
panning. One hundred microliters of phage were added to each well. For 
the first round, the input phage titer was 3 x 10 13 cfu/ml. For subsequent 
rounds, the input phage titer was approximately 10 12 cfu/ml. Phage were 

15 incubated for 2-3 h at RT. The wells were then quickly washed 13 times 
with 300 ul/well of PBS. Bound phage were eluted by incubation with 150 
Ml/well of 50 mM glycine-HCI, pH 2.0 for 15 min. The resulting solution was 
pooled and then neutralized with Tris-HCI, pH 8.0. Log phase TG1 cells 
were infected with the eluted phage, in 2xYT medium for 1 h at 37°C prior to 

20 the addition of helper phage, ampicillin, and glucose (2% final 
concentration). 

After incubation for 1 h at 37°C, the cells were spun down and 
resuspended in 2xYT-AK medium. The cells were then returned to the 
shaker and incubated overnight at 37°C. Phage amplified overnight were 

25 then precipitated and subjected to the next round of panning. A total of 96 
clones were picked at random from rounds 3 and 4 and screened for binding 
activity. Several clones from each pan were further tested for binding to IR 
or IGF-1 R in phage ELISA by competition with soluble peptides as 
described in Beasley et al. International Application PCT/US00/08528, filed 

30 March 29, 2000, and Beasley et al., U.S. Application Serial No. 09/538,038, 
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filed March 29, 2000. Competition was performed by addition of 5 ^il of RP9 
peptide, recombinant D8 peptide, or both per well, followed by addition of 
100 |il of phage per well. Representative peptides are shown in Figures 
44A-44B and in Table 16, below. 
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C. Determination of Amino Acid Preferences 

For both monomer and dimer peptides, amino acid preferences for 
each peptide were determined as follows. The expected frequency of each 
of the 20 amino acids at that position was calculated based on codon usage 
5 and % doping for that library. This was then compared to the actual 
frequency of occurrence of each amino acid at every position after four 
rounds of biopanning. Any amino acid that occurred at a frequency >2-fold 
was considered preferred. Most preferred amino acid(s) were those that 
have the greatest fold enrichment after panning. Preferred amino acid 
10 sequences for RP9, D8, and Formula 10 (Group 6) peptides are shown 
below. 



TABLE 17 



Peptide 


Sequence 


SEQ ID NO: 


RP9 


GSLDESFYDWFERQLG 


1559 


Regular 


GLADEDFYEWFERQLR 
L 


2223 


w/ Peptide 


GQLDEDFYEWFDRQLS 
A 


2224 


w/ Insulin 


GFMDES FYEWFERQLR 
W A 


2225 



Table 17 shows preferred amino acid sequences for RP9 peptides. 

15 Residues in bold indicate strong preference; underlined residues indicate 
positions where more than one amino acid preference is seen. The first 
column indicates the conditions used for the panning procedure. "RP9" 
indicates sequence of the parent RP9; "Regular" indicates regular pan as 
described in methods for panning of random libraries; "w/ peptide" indicates 

20 panning in the presence of 2 nM RP9 peptide; "w/ insulin" indicates panning 
in the presence of 2 nM insulin. 
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TABLE 18 



Peptide 


Sequence 


SEQ ID NO: 


D8 Parent: 


WLDQEWAWVQCEVYGRGCPS 


2129 


Dimer Consensus 


sLEEEWaQIECEVY/WGRGCps 


2226 


Monomer Consensus 


sLEEEWaQI qCEIY/WGRGCry 
W 


1548 



Table 18 shows preferred amino acid sequences for D8 peptides. 
Upper case residues in bold indicate strong preference (>90% frequency); 
5 upper case letters, non-bold, indicate some preference (5-15% higher 
frequency than expected); lower case letters indicate less preference (2-5% 
higher frequency than expected); similar preferences seen in D8 in both 
monomer and dimer libraries. The underlined Y/W indicates that both 
residues are equally preferred at that position. In the original D8 sequence 
1 0 that position is occupied by Y. 



TABLE 19 



Peptide 


Sequence 


Type 


SEQ ID NO: 


Group 6 


W(A/E)GYEW(F/L) 


preferred core 


1549 


Group 6 


DSDWAGYEWFEEQLD 


preferred sequence 


1595 



Table 19 shows preferred amino acid sequences for Group 6 
peptides. Underlined residues indicate preferred N-terminal and C-terminal 
15 extensions. 

Example 12: Fluorescence-Based hlGF-1R Binding Assays 

A. Heterogeneous Time-Resolved Fluorometric Assays 

The effect of recombinant peptide G33 (rG33) on the binding of 
biotinylated-recombinant human IGF-1 (b-rhlGF-1) to recombinant human 
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IGF-1R (rhlGF-1R) was determined using heterogeneous time-resolved 
fluorometric assays (TRF; DELFIA®, PE Wallac, Inc.). The rhlGF-1R protein 
included the extracellular domain of the receptor pre-propeptide, up to 
amino acid residue 932 (A. Ullrich et al„ 1986, EMBO J. 5:2503-2512). 
5 Duplicate data points were collected at each concentration of competitor and 
the lines were designed to represent the best fit to a four-parameter non- 
linear regression analysis (y = min + (max-min)/(1+10 A ((loglC 50 - 
x)*Hillslope))) of the data, which was used to determine IC 50 values. 

The assay was performed using a 96-well clear microplate (NUNC 

10 MaxiSorp) with a final volume of 100 |jJ. Microtiter plates were coated with 
0.1 ^ig rhlGF-1R in 100 ptl of NaHC0 3 , pH 8.5 buffer, and incubated 
overnight at room temperature (RT). The plates were washed 3-times with 
0.05 M Tris-HCI (pH 8 at 25°C), 0.138 M NaCI, 0.0027 M KCI (TBS). This 
was followed by addition of 200 ^il blocking buffer (TBS containing 0.05% 

15 Bovine Serum Albumin (BSA, Cohn Fraction V)), and incubated for 1 h at 
RT. The plates were washed 6-times with a 1 X solution of Wallac's 
DELFIA® wash concentrate. Competitor was added in a volume of 50 |il and 
serially diluted across the microtiter plate in TBS containing 0.05% BSA. 
Non-specific binding (background) was determined in the presence of 60 

20 hlGF-1. 

Fifty microliters of b-rhlGF-1, 10 nM, diluted in TBS containing 0.05% 
BSA was added. The plates were incubated for 2 h at RT. After incubation, 
plates were washed 6-times with a 1X solution of Wallac's DELFIA® wash 
concentrate. Then the plates were treated with 100 pL of Wallac's DELFIA® 

25 Assay Buffer containing a 1:1000 dilution of europium-labeled streptavidin 
and incubated for 2 h at RT. This was followed by washing 6-times with a 1 
X solution of Wallac's DELFIA® wash concentrate. One hundred microliters 
of Wallac's DELFIA® enhancer was added, and the plates were shaken for 
30 min at RT. After shaking, the fluorescence signal at 620 nm was read on 

30 a Victor 2 1420 plate reader (PE Wallac, Inc.). Primary data were 
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background corrected, normalized to buffer controls, and then expressed as 
% specific binding. The Z'-factor was greater than 0.5 (Z' = 1-(3ct + +3ct.)/|h + - 
fi.|; Zhang et a/., 1999, J. Biomol. Screen. 4:67-73) and the signal-to- 
background (S/B) ratio was -20. The results of these experiments are 
5 shown in Figure 45. The IC 5 o value calculated for rG33 is shown in Table 
20, below. 

The effect of recombinant peptides D815 (rD815), RP9, D815-6aa- 
G33, D815-6aa-RP9, and D815-12aa-RP9 on the binding of b-rhlGF-1 to 
rhlGF-1R was determined using the fluorometric assay described above. 

10 IGF-1 was used as a control. Duplicate data points were collected at each 
concentration of competitor and the lines represent the best fit to a four- 
parameter non-linear regression analysis, which was used to determine IC50 
values. Results for rD815 are show in Figure 46; results for RP9 are shown 
in Figure 47; results for D815-6-G33 are shown in Figure 48; results for 

15 D815-6-RP9 are shown in Figure 49; and results for D815-12-RP9 are 
shown in Figure 50; the results for IGF-1 are shown in Figure 51 . The IC 50 
values for the rD815, RP9, D815-6aa-G33, D815-6aa-RP9, and D815-12aa- 
RP9 peptides, and IGF-1 are shown in Table 20, below. Linker sequences 
are underlined. 

20 TABLE 20 



Competitor 


Sequence 


SEQ ID 

NO: 


ICbo(M) 


rG33 


GIISQSCPESFYDWFAGQVSDPWWCW 


1600 


1.45 x 10"° M 


rD815 


WLDQERAWLWCEISGRGCLS 


2206 


4.08 x 10"° M 


RP9 


GSLDESFYDWFERQLG 


1559 


4.17 x 10" M 


D815-6aa-G33 


WLDQERAWLWCEISGRGCLSGGSGGSGIIS 
QSCPESFYDWFAGQVSDPWWCW 


2210 


6.24 x 10" M 


D815-6aa-RP9 


WLDQERAWLWCEISGRGCLSGGSGGSGSL 
DESFYDWFERQLGKK 


2211 


3.57 x 10"° M 


D815-12aa-RP9 


WLDQERAWLWCEISGRGCLSGGSGGSGG I 
SGGSGSLDESFYDWFERQLGKK 


2212 


3.22 x 10" y M 


IGF-1 






6.85 x 10* 1U M 
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The order of potency of all peptides or dimers compared to IGF-1 was 
determined as: IGF-1 > D815-12aa-RP9 » D815-6aa-RP9 > RP9 = D815- 
6aa-G33 > rG33 > rD815. These results suggest that the coupling of D815 
with RP9 using an extended linker (12 versus 6 amino acids) produced a 
5 potent competitor that approximates the affinity of IGF-1 for its own receptor. 

B. Time-Resolved Fluorescence Resonance Energy Transfer 
Assays 

The effect of Site 1 peptides, Site 2 peptides, and rhlGF-1 on the 
dissociation of biotinylated-20E2 (b-20E2, Site 1) from recombinant human 

10 IGF-1 R was determined using time-resolved fluorescence resonance energy 
transfer assays (TR-FRET). Best fit non-linear regression analysis of the 
data, was used to determine dissociation rate constants. Each data point 
represents a single observation. 

The assay was performed using a 96-well white microplate (NUNC) 

15 with a final volume of 100 pi. Final incubation conditions were 16.5 nM b- 
20E2, 2.2 nM SA-APC (streptavidin-allophycocyanin), 2.2 nM Eu 3+ -rhlGF-1R 
(LANCE™ labeled, PE Wallac, Inc.), 0.05 M Tris-HCI (pH 8 at 25°C), 0.138 
M NaCI, 0.0027 M KCI, and 0.1% BSA (Cohn Fraction V). Reactions were 
allowed to reach equilibrium for 6 h at RT. Following this, various peptides 

20 or IGF-1 were added at a final concentration of 100 |nM or 30 jiM, 
respectively. The addition of peptides or IGF-1 initiated the measurement of 
dissociation (Time Zero, sec). The fluorescence signal at 665 nm was read 
on a Victor 2 1420 plate reader (PE Wallac, Inc.) at 30 sec intervals. 

Results of these experiments are shown in Figure 52. The buffer 

25 controls did not vary over the time interval of study, which demonstrated that 
the equilibrium was not disturbed by the addition of diluent at Time zero. 
The addition of excess (> 1000-fold 20E2 K d for IGF-1 R) Site 1 peptides 
such as H2C, 20E2, or RP6 did not differ depending on specific the peptide 
used, and the dissociation rates of b-20E2 were similar for these peptides. 

30 D8B12 (Site 2 peptide) and IGF-1 (binds both Site 1 and Site 2) did 
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demonstrate significant differences in the rate of dissociation of b-20E2. 
This would suggest that these agents act as non-competitive or allosteric 
regulators of Site 1 binding. 

The effect of various peptides or peptide dimers on the binding of 
5 biotinylated-20E2 (B-20E2) to recombinant human IGF-1R was determined 
using TR-FRET assays, described above. For these experiments, each 
data point represents the average of two replicate wells. The lines represent 
the best fit to a four-parameter non-linear regression analysis (y = min + 
(max-min)/(1+10 A ((loglC 5 o-x)*Hillslope))) of the data, which was used to 

10 determine IC 5 o values. 

The assays were performed using a 384-well white microplate 
(NUNC) with a final volume of 30 Final incubation conditions were 15 nM 
b-20E2, 2 nM SA-APC, 2 nM Eu 3+ -rhlGF-1R (LANCE™ labeled, PE Wallac, 
Inc.), 0.05 M Tris-HCI (pH 8 at 25°C), 0.138 M NaCI, 0.0027 M KCI, and 

15 0.1% BSA (Cohn Fraction V). After 16-24 h of incubation at RT, the 
fluorescence signal at 665 nm and 620 nm was read on a Victor 2 1420 plate 
reader (PE Wallac, Inc.). Primary data were background corrected, 
normalized to buffer controls, and then expressed as % specific binding. 
The Z'-factor was greater than 0.5 (Z' = 1-(3a + +3a-)/|M+-^-|; Zhang et a/., 

20 1999, J. Biomol. Screen. 4:67-73) and the signal-to-background (S/B) ratio 
was ~ 4. Results of these experiments are shown in Figure 53. Table 21, 
below, shows the IC 5 o values calculated for these experiments. Notably, the 
C1 peptide showed IGF-1R affinities of -1 nM (Figure 53) and -10 nM 
(Table 21 ) in these assays. 

25 TABLE 21 



Competitor 


Sequence 


SEQ 
ID NO: 


Formula 


Site 
IGF-1 R 


IC 50 (M) 


C1 


CWARPCG DAAN F YD W FVQQ AS 


1550 


1 


1 


8.80E-10 


IGF-1 










2.93E-09 


RP9 


GSLDESFYDWFERQLG 


1559 


1 


1 


3.93E-08 
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20E2 


DYKDFYDAIDQLVRGSARAGGTRD 


2209 


2 


1 


1.04E-07 


E8 


GGTVWPGYEWLRNA 


2118 


10 


2 


2.53E-07 


H2C 


FHENFYDWFVQRVSKK 


2117 


1 


1 


4.60E-07 


S173 


LDALDRLMRYFEERPSL 


1830 


3 


1 


6.29E-06 


D8B12 


WLEQERAWIWCEIQGSGCRA 


1884 


6 


2 


1.13E-05 


A6 


SAKN F YD W FVKK 


1551 


1 


1 


3.10E-05 



C. Fluorescence Polarization Assays 

The effect of various peptide monomers and dimers on the binding of 
fluorescein-RP9 (FITC-RP9) to soluble human insulin receptor- 
immunoglobulin heavy chain chimera (sIR-Fc; Bass et al., 1996, J. Biol. 
5 Chem. 271:19367-19375) was determined using fluorescence polarization 
assays (FP). For these experiments, each data point represents the 
average of two replicate wells. The lines represent the best fit to a four- 
parameter non-linear regression analysis of the data, which was used to 
determine IC 5 o values. 

10 The assays were performed in a 384-well black microplate (NUNC) 

with a final volume of 30 uJ. Final incubation conditions were 1 nM FITC- 
RP9, 10 nM SIR, 0.05 M Tris-HCI (pH 8 at 25°C), 0.138 M NaCI, 0.0027 M 
KCI, 0.05% BGG (bovine gamma globulin), 0.005% Tween-20®. After 16-24 
h of incubation at RT, the fluorescence signal at 520 nm was read on an 

15 Analyst™ AD plate reader (LJL BioSystems, Inc.). Primary data were 
background corrected using 10 nM sIR without FITC-RP9 addition, 
normalized to buffer controls and then expressed as % specific binding. The 
Z'-factor was greater than 0.5 (Z = 1-(3a + +3cr.)%+-M-|; Zhang et al., 1999, J. 
Biomol. Screen. 4:67-73) and the assay dynamic range was -125 mP. In 

20 parallel with these experiments, TR-FRET assays were performed using 
rhlGF-1R and b-20E2, as described above. Results of the FP and TR-FRET 
experiments are shown in Table 22, below. 
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TABLE 22 



Peptide 


FP 

sIR-Fc 


TR-FRET 
rhlGF-1R 


Bndg Ratio 
IGF-1R/IR 


Form. 


Site 
IGF-1 R 


SEQ ID 

NO: 


Sequence 


RP4 


17 


8100 


476 


2 




1552 


PPWGARFYDAIEQLVFDNL 


S175 


10 


1650 


165 


1 




1560 


GRVDWLQRNANFYDWFVA 
ELG 


RP15 


28 


706 


25 


1 


1 


2130 


SQAGSAFYAWFDQVLRTV 


H2C (D117) 


66 


600 


9 


1 


1 


2117 


FHENFYDWFVQRVSKK 


20E2(D118) 


51 


100 


1.9 


2 




2209 


DYKDFYDAIDQLVRGSARA 
GGTRD 


RP9 


24 


33 


1.4 


1 




1559 


GSLDESFYDWFERQLG 


G33 


139 


178 


1.3 


1 




1600 


GIISQSCPESFYDWFAGQV 
SDPWWCW 


E8 (D120) 


206 


175 


0.85 


10 


2 


2118 


GGTVWPGYEWLRNA 


C1 (D112) 


52 


10 


0.19 


1 


1 


1550 


CWARPCGDAANFYDWFV 
QQAS 


RP16 


6400 


961 


0.15 






1553 


VMDARDDPFYHKLSELVT 



FP sIR-Fc column shows IC 50 (nM) values obtained (vs. FITC-RP9); TR-FRET rhlGF-1R 
column shows IC 50 (nM) values obtained (vs. b~20E2); for binding ratio: higher values 
indicated higher affinity for IR than IGF-1 R. Form. = formula; Bndg. = binding. 



These results demonstrated that S175, RP4, and RP15 showed high 
affinities for IR and showed high binding ratios for IGF-1 R over IR. H2C, 
20E2, RP9, and C1 were slightly less potent than S175, RP4, and RP15 at 
10 IR, and these peptides had lower binding ratios for IGF-1 R over IR. G33 
and E8 were less potent than S175, RP4, and RP15 at IR, and showed 
comparable binding to IGF-1 R and IR. RP16 had poor potency at IR and 
IGF-1 R, but had higher affinity for IGF-1 R than IR. 

Example 13: IGF-1 R Binding Peptides - Additional Isolates 

15 The isolation and characterization of peptides which bind to and 

subdivide the insulin receptor binding site into multiple, non-overlapping 
regions designated Site 1 and Site 2 has been previously described 
(Beasley et a/., U.S. Application Serial No. 09/538,038, filed March 29, 2000, 
published as WO 01/72771; Pillutla et aL, U.S. Patent Application Serial No. 
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09/962,756, filed September 24, 2001; Pillutla et a/., 2002, J. BioL Chem. 
277:22590-22594). To identify IGF-1R antagonists, a multi-tiered approach 
was used. First, Site 1 peptides with greater selectivity for IGF-1R as 
compared to IR were identified. Second, secondary libraries were 
5 generated using information from the primary library pannings. These 
secondary libraries were designed to define the amino acid requirements for 
binding, specificity, and affinity. 

To determine optimal sequence requirements within the motif, a 
secondary library based on a clone identified from the random library was 

10 made where the flanking regions were held constant, while the core was 
allowed to change. The library was prepared from doped oligonucleotides 
so that half of the amino acid residues (on average) in the core sequence 
were altered per peptide. Panning of these libraries identified substitutions 
within the core that did or did not affect binding. In an alternative approach, 

15 amino acids in the flanking regions conferring binding affinity and/or 
specificity were defined by designing secondary libraries wherein the core 
was held constant and the flanking sequences were either doped or 
randomized. For both types of libraries, amino acids optimal for binding 
were selected by panning against IGF-1R. Once secondary peptides with 

20 the appropriate binding characteristics were identified, a preferred peptide 
was defined. To do this, the amino acids at each position were optimized 
based on a comparison of the expected results from the doping strategy and 
the actual results observed in the binding population. 

A. Primary Peptide Libraries 

25 The E. coli, strain TG1 (genotype = K12 A(lac-pro), supE, thi, 

hsdA5/F'[traD36, proAB, lacF, lacZAM15\) was obtained from Pharmacia 
(Piscataway NJ). DNA fragments coding for peptides containing 40 random 
amino acids were generated by a PCR-technique using synthetic 
oligonucleotides. A 145-base oligonucleotide was synthesized to include 

30 the sequence (NNK) 4 o where N = A, C, T, or G and K = G or T. This 
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oligonucleotide was used as the template in PCR reactions along with two 
shorter oligonucleotide primers, both of which were biotinylated at their 5' 
ends. The resulting product was purified, concentrated, and ligated to the 
phagemid pCANTABSE (Pharmacia). The ligation product was purified and 
5 electroporated into competent bacterial cells. The transformants were 
grown at 37°C for 1 h, pooled and plated onto selection medium. Depending 
upon the amount of DNA electroporated, the diversity of the random 40mer 
peptide cell library was found to be between 1.6 X 10 10 and 1 x 10 11 
independent clones. The phage library was produced by rescue of the cell 

10 library according to standard phage preparation protocols (G.P. Smith and 
J.K. Scott, 1993, Methods Enzymol. 217:228-257). Phage titers were 
usually at 4 X 10 13 cfu/ml. In previous experiments, sequencing of randomly 
selected clones from the cell library indicated that about 54% of all clones 
were in-frame. The short FLAG sequence (DYKD; SEQ ID NO:1545), was 

15 included at the N-terminus as an immunoaffinity tag. In addition, the E-tag 
epitope (GAPVPYPDPLEPR; SEQ ID NO:XX) was engineered into the 
carboxy terminus of the peptide. Additional random phage libraries of 
20mer peptides were constructed using a similar approach. The diversity of 
these cell libraries was estimated to be > 1.1 X 10 11 clones. 

20 B. Secondary and Tertiary Libraries 

The desired number of amino acid mutations were introduced in the 
parental peptide at the codon level when the synthetic DNA template was 
produced. For example, where a change in 45% of the amino acids was 
desired (i.e., 9 changes/20 amino acids), then a 60% change at the codon 
25 level was needed due to the redundancy of the genetic code (efficiency 
factor of 0.75). Per position, this translated to 20% doping at the level of 
DNA synthesis. At the DNA synthesis level, a 20% doping included the 
following ratio of nucleotides in the synthetic template: 

A 80% A, 6.6% C, 6.6% G, 6.6% T 
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C 6.6% A 80% C, 6.6% G, 6.6% T 
G 6.6% A 6.6% C, 80% G, 6.6% T 
I 6.6% A 6.6% C, 6.6% G, 80% T 
In this chart, the A, C, I, G (underlined and in bold) bases represent 
5 the original bases in the parental sequence. When the clones from cell 
libraries were sequenced and the number of amino acid mutations was 
examined per peptide, the average number of changes was found to 
correlate to the desired value. After the synthetic template was obtained, 
the DNA was ligated to the pCANTBA5E phagemid vector to produce the 
10 cell library in the TGI strain as previously described. Phage rescue was 
carried out to produce the phage library used in the panning experiments. 

C. Panning of peptide libraries 

A standard method was used to coat and block all microtiter plates. 
Plates were coated with IGF-1R in 0.2 M NaHC0 3 , pH 9.4. One hundred 
15 microliters of solution containing 100 ng of IGF-1R was added to an 
appropriate number of wells in a 96-well microtiter plate (MaxiSorp plates, 
Nunc) and incubated overnight at 4°C. Wells were then blocked with a 
solution of 2% non-fat milk in PBS (MPBS) at room temperature (RT) for at 
least 1 h. 

20 Four to eight wells coated with IGF-1R were used for each round of 

panning. One hundred microliters of phage were added to each well. For 
the first round, the input phage titer was ~10 13 cfu/ml. For subsequent 
rounds, the input phage titer was approximately 10 12 cfu/ml. Phage were 
allowed to bind for 2-3 h at RT. The wells were then quickly washed 13 

25 times with 300 ul/well of PBS. Bound phage were eluted by incubation with 
150 ul/well of 50 mM glycine-HCI, pH 2.0 for 5 min. The resulting solution 
was pooled and then neutralized with Tris-HCI, pH 8.0. 

Log phase TG1 cells were infected with the eluted phage, in 2xYT 
medium for 1 h at 37°C prior to the addition of helper phage, ampicillin and 

30 glucose (2% final concentration). After incubation for another hour at 37°C, 



WO 03/027246 



PCT/US02/30412 



-160- 

the cells were spun down and resuspended in 2xYT-AK medium. The cells 
were then returned to the shaker and incubated overnight at 37°C. Phage 
amplified overnight was then precipitated and subjected to the next round of 
panning. A total of 96 clones were picked at random from rounds 3 and 4 
5 and screened for binding activity. 

D. ELISA Analyses of Phage 

For phage pools, cells from frozen stocks were grown and phage 
were prepared as described above. For analysis of individual clones, 
colonies were picked and phage prepared as described above. Subsequent 

10 steps were the same for pooled and clonal phage. Microtiter wells were 
coated and blocked as described above. Wells were coated with either IGF- 
1R or IR. Phage resuspended in MPBS (PBS containing 2% non-fat milk) 
were added to wells (100 |al/well) and incubated at room temperature for 1 h. 
The phage solution was then removed, and the wells were washed three 

15 times with PBS at room temperature. 

Anti-M13 antibody conjugated to horseradish peroxidase (Pharmacia 
Biotech) was diluted 1:3000 in MPBS and added to each well (100 jal/well). 
Incubation was for another hour at room temperature, followed by PBS 
washes as described. Color was developed by addition of ABTS solution 

20 (100 jil/well; Boehringer). Color development was stopped by adjusting 
each well to 0.5% SDS. Plates were analyzed at 405 nm using a 
SpectraMax 340 plate reader (Molecular Devices) and SoftMax Pro 
software. Data points were averaged after subtraction of appropriate 
blanks. A clone was considered "positive" if the A405 of the well was > 2-fold 

25 over background. 

E. Determination of Amino Acid Preferences 

Amino acid preferences for each peptide were determined as follows. 
The expected frequency of each of the 20 amino acids at that position was 
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calculated based on codon usage and % doping for that library. This was 
then compared to the actual frequency of occurrence of each amino acid at 
every position after four rounds of biopanning. Any amino acid that occurred 
at a frequency >2-fold was considered preferred. The most preferred amino 
5 acid(s) were defined as those with the greatest enrichment after panning. 
Using the amino acid preferences determined for each position, peptides 
with the most preferred sequences were designed. 

Representative monomer and dimer peptides identified by panning 
secondary libraries for binding to IGF-1R are shown in Figures 54A-54B, 
10 55A-55B, 56A-56B, 57A-57B, 58A-58B, 59A-59B, 60A-60C, 61A-61B, 62A- 
62B, 63A-63B, and 64A-64B. Primary library pannings produced several 
peptides, including RP6, RP48, RP52, RP54, RP56, and RP60, described 
above. Peptides designed according to amino acid preferences (i.e., 
peptides by design) included RP30-IGF, RP31-IGF, and RP33-IGF. 

15 Example 14: IGF-1 Antagonist Peptides 

A. Cells and Reagents 

MCF-7 and MiaPaCa cell lines were obtained from the American 
Type Culture Collection (Manassas, VA). Cells were routinely grown in 
RPMI1640 medium supplemented with 10% fetal bovine serum and 1% 
20 glutamax. The extra-cellular domain of IGF-1 R was obtained as a 
recombinant protein from R&D Systems (Minneapolis, MN). 

B. Whole-cell lysates 

For qualitative IRS-1 phosphorylation analysis, MCF-7 cells in 
monolayer cultures (about 80% of confluency) were used. After about 20 h 
25 of starvation in serum-free RPMI medium (GibcoBRL), cells were stimulated 
for 10 min in the same medium containing IGF-1 (Peprotech), or IGF-1 plus 
peptides (synthetic peptides produced by Research Genetics), or no 
addition as a negative control. After treatment, cells were rinsed twice with 
ice-cold PBS containing 0.2 mM PMSF and 1 mM Na 3 V0 4 (all from SIGMA). 
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Cells were scraped into the same buffer and pelleted by centrifugation at 
200 x g for 3 min. Lysis was done in RIPA buffer (0.8766% NaCI, 0.11% 
SDS, 0.5% deoxycholic acid (all from SIGMA), 1% Triton X-100, (Boehringer 
Mannheim)) containing phosphatase inhibitor cocktails 1 and 2 (SIGMA) and 
5 protease cocktail inhibitor tablet (Boehringer Mannheim) for 5 min on ice. 
Cell lysates were cleared by centrifugation for 5 min at 14 000 x g and the 
resulting supernatant was snap-frozen in ethanol-dry ice and stored at - 80° 
C. The protein concentration was determined using the D c Protein Assay Kit 
(Bio-Rad Laboratories). 

1 0 C. Immunoprecipitation and Western Blot Analysis 

Immunoprecipitations were performed with pre-cleared lysates for 4 h 
at 40° C using 0.3-0.5 mg total protein with 1 jag polyclonal anti-IRS-1 
antibody (Upstate Biotechnology), and 25 |xl protein A/agarose slurry 
(SIGMA). Agarose beads with immobilized proteins were washed 3 times 

15 with IP wash buffer (50 mM Tris pH 7.5 (GibcoBRL), 150 mM NaCI, 1 mM 
Na 3 V0 4 , 0.2 mM PMSF). Protein elutions and denaturation were done for 3 
min at 95° C in 30 jxl of Laemmle sample buffer (Bio-Rad Laboratories) with 
0.5 M p-mercaptoethanol (SIGMA). 

Immunoprecipitates were subjected to SDS-PAGE on 4-15% Tris-HCI 

20 Ready Gels and transferred to Trans-Blot Transfer Medium nitrocellulose 
membranes (both from Bio-Rad Laboratories). Membranes were blocked 
with PBS-Tween 20 (SIGMA) containing 2% non-fat milk. For detection of 
IRS-1 protein, blots were incubated with anti-IRS-1 antibody, followed by 
secondary antibody goat anti-rabbit IgG, HRP-conjugate. For detection of 

25 phosphorylated IRS-1, blots were incubated with monoclonal anti- 
phosphotyrosine (4G10) HRP-conjugated antibody. All antibodies were 
obtained from Upstate Biotechnology. Blots were exposed to an enhanced 
chemifluorescence substrate (ECL Western Blotting Analysis System, 
Amersham Pharmacia Biotech). Films were developed and fluorescent 

30 signal was visualized for qualitative analysis. 
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D. MCF-7 and MiaPaCa Celt Assays 

Peptides produced synthetically were maintained as 30 mM stock in 
100% DMSO, while recombinant dimers were diluted in water. All synthetic 
and recombinant peptides were stored at -80°C. The final concentration of 
5 DMSO was < 0.1%. MCF-7 and MiaPaCa (ATCC, Rockville, MD) cells were 
maintained in RPMI containing 10% FBS. All cells were starved overnight 
by growing them in RPMI media, which was serum free. Cells were 
trypsinized, washed twice with PBS before being seeded at 1-3 x 10 3 cells 
per well in a 96-well plate with a volume of 150 |il/well. All points were done 

10 in duplicate in 96-well plates. For antagonist activity assays, immediately 
before the addition of peptides, all media was gently removed from the 
wells. Peptides were serially diluted 1:2 in a final volume of 150 jil in a 
separate plate using RPMI containing 0.1% FBS plus 50 nM IGF-1. This 
mixture was transferred onto the cells, and the plates were incubated for 72 

15 h at 37°C in a C0 2 incubator. To quantitate cell number, 10 jal of WST-1 
reagent (Roche Molecular Biochemicals, Indianapolis, IN) was added to 
each well and the plates were returned to the 37°C/C0 2 incubator for 
approximately 2 h. Measurements were then read at 440 nm, with 700 nm 
used as a reference. 

20 E. Binding (ALPHAScreen) Assays 

To assay binding, the relative potencies of peptides as compared to 
IGF-1 were analyzed in a competition system utilizing biotinylated-human 
IGF-1 (b-hlGF-1) and His-tagged soluble recombinant human IGF-1 R 
(srhlGF-1R-his; R&D systems, Inc., Minneapolis, MN). Detection of the 
25 receptor ligand interaction was measured in an amplified luminescent 
proximity homogeneous assay (ALPHAScreen; BioSignal-Packard, 
Montreal). The assay was performed in 384-well Nunc™white polystyrene 
microplates (Nalge Nunc International, Naperville, IL) with a final volume of 
40 ixl. Final incubation conditions were 1 nM b-hlGF-1, 10 nM srhlGF-1R- 
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his, 0.025 M HEPES (pH 7.4 at 25°C), 0.100 M NaCI, 0.1% BSA (Cohn 
Fraction V; Sigma Chemical Co., St. Louis, MO), 10 ^g/ml nickel conjugated 
acceptor beads, and 10 jag/ml streptavidin conjugated donor beads. 

For the first step of the assay, hlGF-1 (PeproTech, Inc., Rocky Hill, 
5 NJ), b-hlGF-1 (see below), and peptides were incubated for 2 h at room 
temperature. Each concentration of competitor was assayed in duplicate. 
Non-specific binding was determined in the presence of 3 x 10~ 5 M hlGF-1. 
In the second step of the assay, acceptor beads were added and the 
incubation was continued for 0.5 h. In the final step, donor beads were 

10 added and the incubation was continued for an additional 1 h. At the end of 
the incubation period, the fluorescence signal at 520 nm was read on a 
Fusion-a HT plate reader (Packard Bioscience Company, Meriden, CT). 
Primary data were background corrected, normalized to buffer controls, and 
then expressed as % specific binding. The data were fit to a four-parameter 

15 non-linear regression analysis ( y = min + (max-min)/(1+10 A ((loglC50- 
x)*Hillslope)) ), which was used to determine IC 50 values. The Z'-factor for 
this assay was greater than 0.7 (Z = 1-(3a++3a-)%+-|a-|) and the signal-to- 
background (S/B) ratio was between 40 and 70. 

Human IGF-1 was biotinylated on free amino groups using Pierce EZ- 

20 Link™ Sulfo-NHS-LC-Biotinylation Kit (PN #21430, Pierce, Rockford, IL). 
Human IGF-1, at 2 mg/ml in PBS, pH 7.2, was incubated at room 
temperature for 30 min with a 20-fold excess of sulfo-NHS-LC-biotin over 
theoretical total free amino groups. Unreacted biotins were removed by 
extensive dialysis (Pierce Slide-A-Lyzer® Dialysis Cassettes) against PBS, 

25 and degree of conjugation was determined by HABA (2-(4- 
hydroxyazobenzene) benzoic acid) assay (Pierce product literature #21430). 
Number of biotins per hlGF-1 varied between 3 and 5. 
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F. FDC-P2 Cell Assays 

Peptides produced synthetically were maintained as 30 mM stock in 
100% DMSO, while recombinant dimers were diluted in water. All synthetic 
and recombinant peptides were stored at -80°C. The final concentration of 
5 DMSO was < 0.1%. FDC-P2 (obtained from Dr. J. Pierce, National 
Institutes of Heath, Bethesda, MD) cells were maintained in RPMI containing 
15% FBS and 5% WEHI (Genoquest, Germantown, MD) at 37°C in a C0 2 
incubator. To initiate experiments, all cells were starved for 5 h in RPMI 
containing 1% FBS. Cells were seeded at 1 x 10 4 cells per well into 96-well 
10 plates at a volume of 75 ul/well. Peptides were added at 2X final 
concentrations and all points were done in duplicate. For antagonist assays, 
peptides at 2X concentration were serially diluted 1 :2 in a final volume of 75 
ul in a separate plate using RPMI containing 0.1% FBS and 1 nM IGF-1. 
This mixture was transferred onto the cells to yield a final volume of 150 ul. 
15 The plates were incubated for 48 h at 37°C in a C0 2 incubator. To 
quantitate cell number, 10 jal of WST-1 reagent (Roche Molecular 
Biochemicals, Indianapolis, IN) was added to each well and the plates were 
returned to the 37°C/C0 2 incubator for approximately 2 h. Measurements 
were then taken at 440 nm, with 700 nm used as a reference. 

G. Results 

Peptide RP33-IGF exhibited an affinity for IGF-1 R close to that of 
IGF-1 (9 nM; Table 23). Other peptides, such as RP54 showed affinity in 
the micromolar range (Table 23). In contrast to the observations made for 
IR, competition experiments indicated that IGF-1 R Site 1 and 2 peptides 
were able to compete with each other. This suggested that the functional 
interactions between Site 1 and Site 2 in IGF-1 R differed from those found in 
IR (unpublished data). 

To determine if any Site 1 peptides could act as antagonists, 
proliferation assays were established utilizing IGF-1 and IGF-2 responsive 



20 



25 
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human tumor cell lines. Sixteen human tumor cell lines were screened for 
their ability to proliferate in the presence IGF-1 and IGF-2 under serum-free 
conditions. Two cell lines, MCF-7 (breast carcinoma) and MiaPaCa 
(pancreatic carcinoma), showed the best dose response curves for IGF-1 
5 (ED 5 o = 5 nM; Figures 65A-65F), and were used for subsequent 
experiments. 

Peptides were synthesized and screened in the proliferation assay at 
an IGF-1 dose ten times the ED50 (50 nM). Several antagonist peptides 
were identified, including RP33-IGF, which consistently blocked IGF-1 and 

10 IGF-2 proliferation of both MCF-7 and MiaPaCa (Figures 66B-66C). In 
addition, peptides RP52 and RP54 were found to act as antagonists in at 
least one cell line (Table 26; Figures 70A-70B). Peptides RP52 and RP54 
are classified as miscellaneous peptides, which were not categorized into 
any of the formulae (e.g., Formula 1, Formula 2, etc.) disclosed herein. 

15 Experiments were then performed to determine whether antagonist 

peptides could block receptor activation at the level of key signaling 
intermediate, IRS-1. First, the optimal time and concentration of IGF-1 
needed for maximal activation of IRS-1 was established (Figures 67A-67B 
and Figures 68A-68B). Maximum phosphorylation of IRS-1 was observed 

20 after 10 min of treatment and was followed by a drop-off of the signal 
(Figures 67A-67B). This pattern was presumably due to degradation of the 
IRS-1 protein by a mechanism involving proteasomes (Lee etal., 2000, Mol. 
Cell. Biol., 2000, 20:1489-1496). Second, RP33-IGF was compared to two 
unrelated peptides. The RP33-IGF peptide inhibited IRS-1 phosphorylation, 

25 whereas the unrelated peptides had no effect in the proliferation assay 
(Figures 69A-69B). 

The RP6KK peptide was also tested for activity, since the RP33-IGF 
peptide was originally derived from the RP6KK sequence. Both RP6KK and 
RP33-IGF were found to effectively block activation of IRS-1 by IGF-1 

30 (Figures 69A-69B). At the concentration used, greater than 90% of the 
protein was unphosphorylated, indicating that both peptides efficiently 
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blocked IGF-1 R activation. However, RP33-IGF differed from RP6KK by 1 1 
amino acids, and RP33-IGF was a superior IGF-1 R antagonist in the cell 
proliferation assays (Tables 24-25). The difference in biological activity did 
not appear to be related to stability of the peptides since both were found to 
5 remain intact during the course of the assays (unpublished data). 
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Example 15: IGF-1 Agonist Peptides 

A. MCF-7 and MiaPaCa Cell Assays 

Peptides produced synthetically were maintained as 30 mM stock in 
100% DMSO, while recombinant dimers were diluted in water. All synthetic 
5 and recombinant peptides were stored at -80°C. The final concentration of 
DMSO was < 0.1%. MCF-7 and MiaPaCa (ATCC, Rockville, MD) cells were 
maintained in RPMI containing 10% FBS. All cells were starved overnight 
by growing them in serum-free RPMI media. Cells were trypsinized, washed 
twice with PBS before being seeded at 1-3 x 10 3 cells per well in a 96-well 
10 plate in a volume of 150 jal/well. All points were done in duplicate in 96-well 
plates. For agonist activity assays, immediately before the addition of 
peptides, all media was gently removed from the wells. Peptides were 
serially diluted 1:2 in a final volume of 150 ^l in a separate plate using RPMI 
containing 0.1% FBS. The diluted peptide solutions were transferred onto 
15 the cells, and the plates were incubated for 72 h at 37°C in a C0 2 incubator. 
To quantitate cell number, 10 ^il of WST-1 reagent (Roche Molecular 
Biochemicals, Indianapolis, IN) was added to each well and the plates were 
returned to the 37°C/C0 2 incubator for approximately 2 h. Measurements 
were then taken at 440 nm, with 700 nm used as a reference. 

B. FDC-P2 Cell Assays 

Peptides were maintained and stored as indicated above. FDC-P2 
cells (obtained from Dr. J. Pierce, NIH) were maintained in RPMI containing 
15% FBS and 5% WEHI (Genoquest, Germantown, MD) at 37°C in a C0 2 
incubator. To initiate experiments, all cells were starved for 5 h in RPMI 
containing 1% FBS. Cells were seeded at 1 x 10 4 cells per well into 96-well 
plates at a volume of 75 ^il/well. Peptides were added at 2X final 
concentration and all points were done in duplicate. For agonist assays, 
peptides at 2X concentration were serially diluted 1:2 in a final volume of 75 



20 



25 
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|il in a separate plate using RPMI containing 0.1% FBS. The diluted peptide 
solutions were transferred onto the cells to yield a final volume of 150 jd. 
The plates were incubated for 48 h at 37°C in a C0 2 incubator. To 
quantitate cell number, 10 fal of WST-1 reagent (Roche Molecular 
5 Biochemicals, Indianapolis, IN) was added to each well and the plates were 
returned to the 37°C incubator for approximately 2 h. Measurements were 
taken at 440 nm, with 700 nm used as a reference. 

For these experiments, potencies of peptide competition were 
determined using the AlphaScreen assay format. Primary data were 

10 background corrected, normalized to buffer controls and then expressed as 
% specific binding. The data were fit to a four-parameter non-linear 
regression analysis ( y = min + (max-min)/(1+10 A ((loglC 5 o-xrHillslope)) ), 
which was used to determine IC 50 values. The Z'-factor for this assay is 
greater than 0.7 (Z' = 1-(3<j + +3a.)/|(i + -|a-|) and the signal-to-background (S/B) 

1 5 ratio was between 40 and 70. 

C. Results 

Several IGF-1R agonist peptides were identified which consistently 
stimulated proliferation of both MCF-7 and MiaPaCa cells (Tables 27-28; 
Figures 73A-73D and Figures 74A-74I). Monomer peptides with IGF-1R 

20 agonist activity included RP60, RP48, G33, C1, and L-RP9ex (Tables 27- 
28). Dimer peptides with IGF-1R agonist activity included RP30-IGF-12- 
D112, RP30-IGF-12-RP31-IGF, RP31-IGF-12-RP30-IGF, D1 12-12-RP30- 
IGF, RP6-L-D8B12, D8B12-12-RP9, D1 12-12-D1 12, RP9-12-RP9, and 
RP9-L-RP6 (Tables 27-28). Agonist peptides were also identified using 

25 FDC-P2 cell proliferation assays (Table 29). Monomer peptides with IGF-1R 
agonist activity included G33-lig, G33, S175, D815, lig-D815, RP31-IGF, 
and D815 (Table 29). Dimer peptides with IGF-1R agonist activity included 
RP6-RP9, G33-6-G33, and D815-RP9 (Table 29). 
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ln addition, peptides with agonist or antagonist activity in MCF-7 or 
MiaPaCa cell proliferation assays were shown to compete against IGF-1 for 
binding to IGF-1 R (Figures 71A-71F and Figures 72A-72E). Potencies of 
peptide competition were determined using the AlphaScreen assay format 
5 for peptide monomers RP60, RP48, sG33, L-RP9ex, and 12-RP9ex (Figures 
71A-71F). Potencies were also determined for dimer peptides rRP30-IGF- 
12-D112, rRP30-IGF-12-RP31-IGF, rRP31-IGF-12-RP30-IGF, rD1 12-12- 
RP30-IGF, and rD1 12-12-D1 12 (Figures 72A-72E). 

The biological response of the monomers and dimers in the FDC-P2 

10 (myeloid cells; IGF-1 R/IGF-1R receptor), MCF-7 (breast cancer cells; hybrid 
IGF-1 R/IR receptor) and MiaPaCa (pancreatic cancer cells; hybrid IGF- 
1R/IR receptor) assays were compared (Table 30). In some instances, a 
modulatory effect (agonism or antagonism) was seen in certain cell lines but 
not in others. For example, the RP30-IGF peptide exhibited antagonist 

15 activity in FDC-P2 and MiaPaCa cells, but not in MCF-7 cells (Table 30). 
The C1 peptide exhibited antagonist activity in FDC-P2 and MCF-7 cells, but 
not in MiaPaCa cells. The RP9-RP6, L-RP9ex, and D8B12-12-RP9 
peptides exhibited either antagonist or agonist activity depending on the cell 
line used (Table 30). Therefore, it is possible to use the peptides of the 

20 invention to target specific cell types with specific modulatory effects. 
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Incorporated herein by reference in its entirety is the Sequence 
Listing for the application, comprising SEQ ID NO:1 to SEQ ID NO:2227. 
The Sequence Listing is disclosed on three CD-ROMs, designated "CRF", 
"Copy 1", and "Copy 2". The Sequence Listing is a computer-readable 
5 ASCII file named "18784056PC.app.txt", created on September 23, 2002, in 
IBM-PC machine format, on a MS-Windows®98 operating system. The 
18784056PC.app.bct file is 927,477 bytes in size. 



As various changes can be made in the above compositions and 
10 methods without departing from the scope and spirit of the invention, it is 
intended that all subject matter contained in the above description, shown in 
the accompanying drawings, or defined in the appended claims be 
interpreted as illustrative, and not in a limiting sense. 

15 The contents of all patents, patent applications, published articles, 

books, reference manuals, texts and abstracts cited herein are hereby 
incorporated by reference in their entirety to more fully describe the state of 
the art to which the present invention pertains. 
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WHAT IS CLAIMED IS: 

1 . A method of modulating insulin-like growth factor receptor activity in 
insulin-like growth factor-responsive mammalian cells comprising: 
contacting the cells with an amino acid sequence in an amount sufficient to 
modulate the activity of insulin-like growth factor receptor, wherein i) the 
amino acid sequence comprises a Formula 1 sequence, X 1 X 2 X3X 4 X5, such 
that Xi, X 2 , and X 5 are selected from the group consisting of phenylalanine 
and tyrosine, X 3 is selected from the group consisting of aspartic acid, 
glutamic acid, glycine and serine, and X* is selected from group consisting 
of tryptophan, tyrosine and phenylalanine; and ii) with the proviso that the 
amino acid sequence is not insulin, insulin-like growth factor, an anti-insulin 
receptor antibody, an anti-insulin-like growth factor receptor antibody, or 
fragments thereof. 

2. The method of claim 1, wherein the amino acid sequence modulates 
insulin-like growth factor receptor activity in mammalian cells selected from 
the group consisting of breast cancer cells, pancreatic cancer cells, and 
myeloid cells. 

3. The method of claim 2, wherein the amino acid sequence comprises 
an insulin-like growth factor receptor antagonist sequence selected from the 
group consisting of: 

a. H2C-A-H6 (SEQ ID NO:2228); and 

b. RP9 (SEQ ID NO:2229). 

4. The method of claim 2, wherein the amino acid sequence comprises 
an insulin-like growth factor receptor agonist sequence selected from the 
group consisting of: 

a. S175 (SEQ ID NO:2248); and 

b. 12-RP9ex (SEQ ID NO:2250). 
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5. The method of claim 2, wherein the amino acid sequence further 
comprises two or more cysteines which are separated by at least 3 amino 
acids. 

6. The method of claim 5, wherein the amino acid sequence comprises 
an insulin-like growth factor receptor antagonist sequence selected from the 
group consisting of: 

a. C1 (SEQ ID NO:2230); 

b. C1 KK (SEQ ID NO:2266); and 

c. L-RP9ex (SEQ ID NO:2231 ). 

7. The method of claim 5, wherein the amino acid sequence comprises 
an insulin-like growth factor receptor agonist sequence selected from the 
group consisting of: 

a. G33 (SEQ ID NO:2249); and 

b. L-RP9ex (SEQ ID NO:2231 ). 

8. A method of decreasing insulin-like growth factor receptor activity in 
insulin-like growth factor-responsive mammalian cells comprising: 
contacting the cells with an amino acid sequence in an amount sufficient to 
decrease the activity of insulin-like growth factor receptor, wherein i) the 
amino acid sequence comprises a Formula 2 sequence, 
X6X 7 X8X9XioXiiX 12 Xi3, such that X 6 and X 7 are aromatic amino acids, X 8 , X 9 , 
Xn, and Xi 2 are any amino acid, and X i0 and X i3 are leucine; ii) the amino 
acid sequence further comprises two or more cysteines which are separated 
by at least 3 amino acids; and iii) with the proviso that the amino acid 
sequence is not insulin, insulin-like growth factor, an anti-insulin receptor 
antibody, an anti-insulin-like growth factor receptor antibody, or fragments 
thereof. 
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9. The method of claim 8, wherein the amino acid sequence decreases 
insulin-like growth factor receptor activity in mammalian cells selected from 
the group consisting of breast cancer cells, pancreatic cancer cells, and 
myeloid cells. 

10. The method of claim 9, wherein the amino acid sequence comprises 
an insulin-like growth factor receptor antagonist sequence selected from the 
group consisting of: 



a. 


RP33-IGF (SEQ ID NO:2232); 


b. 


RP6KK (SEQ ID NO:2233); 


c. 


RP30-IGF (SEQ ID NO:2234); 


d. 


RP43 (SEQ ID NO:2235); 


e. 


RP33K-IGF (SEQ ID NO:2266); and 


f. 


RP6 (SEQ ID NO:2236). 



11. A method of increasing insulin-like growth factor receptor activity in 
insulin-like growth factor-responsive mammalian cells comprising: 
contacting the cells with an amino acid sequence in an amount sufficient to 
increase the activity of insulin-like growth factor receptor, wherein i) the 
amino acid sequence comprises a Formula 6 sequence, X 6 2 X 6 3 X64 X 6 s X 6 e 

X67 X68 X69 X70 X71 X72 X73 X74 X75 X76 X77 X78 X79 X80 Xsi, SUCh that X62, X65, 

X66 X68, X69, X71, X73, X76, X77, X78, Xso and Xsi are any amino acid, X63 is 
selected from the group consisting of leucine, isoleucine, methionine and 
valine, X70 and X74 are selected from group consisting of valine, isoleucine, 
leucine, and methionine, X64 is selected from group consisting of aspartic 
acid and glutamic acid, X 6 7 is tryptophan, X75 is selected from group 
consisting of tyrosine and tryptophan, and X72 and X79 are cysteines; and ii) 
with the proviso that the amino acid sequence is not insulin, insulin-like 
growth factor, an anti-insulin receptor antibody, an anti-insulin-like growth 
factor receptor antibody, or fragments thereof. 
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12. The method of claim 11, wherein the amino acid sequence increases 
insulin-like growth factor receptor activity in mammalian cells selected from 
the group consisting of breast cancer cells, pancreatic cancer cells, and 
myeloid cells. 

13. The method of claim 12, wherein the amino acid sequence comprises 
an insulin-like growth factor receptor agonist sequence selected from the 
group consisting of: 

a. D81 5 (SEQ ID NO:2252); and 

b. RP31-IGF (SEQ ID NO:2253). 

14. A method of modulating insulin-like growth factor receptor activity in 
insulin-like growth factor-responsive mammalian cells comprising: 
contacting the cells with an amino acid sequence in an amount sufficient to 
modulate the activity of insulin-like growth factor receptor, wherein i) the 
amino acid sequence comprises at least two Formula 1 subsequences, 
XiX 2 X3X4X 5f such that X if X 2 , and X 5 are selected from the group consisting 
of phenylalanine and tyrosine, X 3 is selected from the group consisting of 
aspartic acid, glutamic acid, glycine and serine, and X4 is selected from 
group consisting of tryptophan, tyrosine and phenylalanine; and ii) with the 
proviso that the amino acid sequence is not insulin, insulin-like growth 
factor, an anti-insulin receptor antibody, an anti-insulin-like growth factor 
receptor antibody, or fragments thereof. 

15. The method of claim 14, wherein the amino acid sequence modulates 
insulin-like growth factor receptor activity in mammalian cells selected from 
the group consisting of breast cancer cells, pancreatic cancer cells, and 
myeloid cells. 

16. The method of claim 15, wherein the amino acid sequence comprises 
an insulin-like growth factor receptor antagonist sequence selected from the 
group consisting of: 
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a. RP9-RP9 (C-C; SEQ ID NO:2237); 

b. RP9-RP9 (C-N; SEQ ID NO:2238); and 

c. RP9-L-RP9 (SEQ ID NO:2239). 

17. The method of claim 15, wherein the amino acid sequence comprises 
an insulin-like growth factor receptor agonist sequence, RP9-12-RP9 (SEQ 
ID NO:2254). 

18. The method of claim 15, wherein the amino acid sequence further 
comprises two or more cysteines separated by at least 3 amino acids. 

19. The method of claim 18, wherein the amino acid sequence comprises 
an insulin-like growth factor receptor antagonist sequence, G33-RP9 (SEQ 
ID NO:2240). 

20. The method of claim 18, wherein the amino acid sequence comprises 
an insulin-like growth factor receptor agonist sequence selected from the 
group consisting of: 

a. D1 12-12-D1 12 (SEQ ID NO:2255); and 

b. G33-6-G33 (SEQ ID NO:2256). 

21. A method of decreasing insulin-like growth factor receptor activity in 
insulin-like growth factor-responsive mammalian cells comprising: 
contacting the cells with an amino acid sequence in an amount sufficient to 
decrease the activity of insulin-like growth factor receptor, wherein i) the 
amino acid sequence comprises at least two Formula 2 subsequences, 
XgXyXsXgXioXuXiaX^, such that Xeand X 7 are aromatic amino acids, Xe, Xg, 
Xn, and X 12 are any amino acid, and X 10 and Xi 3 are leucine; ii) the amino 
acid sequence further comprises two or more cysteines separated by at 
least 3 amino acids; and iii) with the proviso that the amino acid sequence is 
not insulin, insulin-like growth factor, an anti-insulin receptor antibody, an 
anti-insulin-like growth factor receptor antibody, or fragments thereof. 
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22. The method of claim 21 , wherein the amino acid sequence decreases 
insulin-like growth factor receptor activity in mammalian cells selected from 
the group consisting of breast cancer cells, pancreatic cancer cells, and 
myeloid cells. 

23. The method of claim 22, wherein the amino acid sequence comprises 
an insulin-like growth factor receptor antagonist sequence, RP30-IGF-12- 
RP30-IGF (SEQ ID NO:2241 ). 

24. A method of modulating insulin-like growth factor receptor activity in 
insulin-like growth factor-responsive mammalian cells comprising: 
contacting the cells with an amino acid sequence in an amount sufficient to 
modulate the activity of insulin-like growth factor receptor, wherein i) the 
amino acid sequence comprises a Formula 1 subsequence X1X2X3X4X5, and 
a Formula 2 subsequence, X6X7X8X9X10XHX12X13, such that X1, X 2 , and X 5 
are selected from the group consisting of phenylalanine and tyrosine, X 3 is 
selected from the group consisting of aspartic acid, glutamic acid, glycine 
and serine, and X4 is selected from group consisting of tryptophan, tyrosine 
and phenylalanine, and such that X 6 and X 7 are aromatic amino acids, X 8 , 
X 9 , X 11t and X12 are any amino acid, and X10 and X13 are leucine; ii) the 
amino acid sequence further comprises two or more cysteines separated by 
at least 3 amino acids; iii) the subsequences are oriented Formula 1 to 
Formula 2; and iv) with the proviso that the amino acid sequence is not 
insulin, insulin-like growth factor, an anti-insulin receptor antibody, an anti- 
insulin-like growth factor receptor antibody, or fragments thereof. 

25. The method of claim 24, wherein the amino acid sequence modulates 
insulin-like growth factor receptor activity in mammalian cells selected from 
the group consisting of breast cancer cells, pancreatic cancer cells, and 
myeloid cells. 
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26. The method of claim 25, wherein the amino acid sequence comprises 
an insulin-like growth factor receptor antagonist sequence, RP9-L-RP6 
(SEQ ID NO:2242). 

27. The method of claim 25, wherein the amino acid sequence comprises 
an insulin-like growth factor receptor agonist sequence selected from the 
group consisting of: 

a. RP9-L-RP6 (SEQ ID NO:2242); and 

b. D1 12-12-RP30-IGF (SEQ ID NO:2257). 

28. A method of increasing insulin-like growth factor receptor activity in 
insulin-like growth factor-responsive mammalian cells comprising: 
contacting the cells with an amino acid sequence in an amount sufficient to 
increase the activity of insulin-like growth factor receptor, wherein i) the 
amino acid sequence comprises a Formula 1 subsequence X1X2X3X4X5, and 
a Formula 2 subsequence, XeXrXsXgXioXuX^X^, such that Xi, X 2 , and X 5 
are selected from the group consisting of phenylalanine and tyrosine, X 3 is 
selected from the group consisting of aspartic acid, glutamic acid, glycine 
and serine, and X4 is selected from group consisting of tryptophan, tyrosine 
and phenylalanine, and such that X 6 and X 7 are aromatic amino acids, X 8 , 
X 9 , Xn, and X 12 are any amino acid, and X 10 and X 13 are leucine; ii) the 
amino acid sequence further comprises two or more cysteines separated by 
at least 3 amino acids; iii) the subsequences are oriented Formula 2 to 
Formula 1; and iv) with the proviso that the amino acid sequence is not 
insulin, insulin-like growth factor, an anti-insulin receptor antibody, an anti- 
insulin-like growth factor receptor antibody, or fragments thereof. 

29. The method of claim 28, wherein the amino acid sequence increases 
insulin-like growth factor receptor activity in mammalian cells selected from 
the group consisting of breast cancer cells, pancreatic cancer cells, and 
myeloid cells. 
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30. The method of claim 29, wherein the amino acid sequence comprises 
an insulin-like growth factor receptor agonist sequence selected from the 
group consisting of: 

a. RP30-IGF-12-D112 (SEQ ID NO:2258); and 

b. RP6-RP9 (SEQ ID NO:2259). 

31. A method of decreasing insulin-like growth factor receptor activity in 
insulin-like growth factor-responsive mammalian cells comprising: 
contacting the cells with an amino acid sequence in an amount sufficient to 
decrease the activity of insulin-like growth factor receptor, wherein i) the 
amino acid sequence comprises a Formula 1 subsequence X1X2X3X4X5, and 
a Formula 6 subsequence, X 6 2 X 63 X** X 6 5 X 68 X 67 X 68 X 6 g X70 X71 X72 X73 X74 
X75 X 7 6 X77 X 78 X79 X 80 X 8 i, such that X 1f X 2 , and X 5 are selected from the 
group consisting of phenylalanine and tyrosine, X 3 is selected from the 
group consisting of aspartic acid, glutamic acid, glycine and serine, and X4 is 
selected from group consisting of tryptophan, tyrosine and phenylalanine, 
and such that X62. X65, X66 X68, X69. X71, X73, X76, X77, X78, X 8 o and X 8 i are 
any amino acid, X 6 3 is selected from the group consisting of leucine, 
isoleucine, methionine and valine; X 7 o and X74 are selected from group 
consisting of valine, isoleucine, leucine and methionine; X64 is selected from 
group consisting of aspartic acid and glutamic acid; X 6 7 is tryptophan; X75 is 
selected from group consisting of tyrosine and tryptophan, and X 72 and X79 
are cysteines; ii) the subsequences are oriented Formula 1 to Formula 6; 
and iii) with the proviso that the amino acid sequence is not insulin, insulin- 
like growth factor, an anti-insulin receptor antibody, an anti-insulin-like 
growth factor receptor antibody, or fragments thereof. 

32. The method of claim 31 , wherein the amino acid sequence decreases 
insulin-like growth factor receptor activity in mammalian cells selected from 
the group consisting of breast cancer cells, pancreatic cancer cells, and 
myeloid cells. 
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33. The method of claim 32, wherein the amino acid sequence comprises 
an insulin-like growth factor receptor antagonist sequence, G33-D8B12 
(SEQ ID NO:2243). 

34. A method of modulating insulin-like growth factor receptor activity in 
insulin-like growth factor-responsive mammalian cells comprising: 
contacting the cells with an amino acid sequence in an amount sufficient to 
modulate the activity of insulin-like growth factor receptor, wherein i) the 
amino acid sequence comprises a Formula 1 subsequence X1X2X3X4X5, and 
a Formula 6 subsequence, X 6 2 Xq 3 Xs4 X 65 X 66 X 67 X 68 X 69 X 70 X71 X72 X73 X 74 
X 75 X 76 X 77 X 78 X 79 Xso Xsi, such that X1, X 2 , and X 5 are selected from the 
group consisting of phenylalanine and tyrosine, X 3 is selected from the 
group consisting of aspartic acid, glutamic acid, glycine and serine, and X4 is 
selected from group consisting of tryptophan, tyrosine and phenylalanine, 
and such that X62, X65, X66 X 88 , X69, X 7 i, X 7 3, X 7 6, X 77 , X 78 , Xso and X 8 i are 
any amino acid, X 6 3 is selected from the group consisting of leucine, 
isoleucine, methionine and valine; X 70 and X 74 are selected from group 
consisting of valine, isoleucine, leucine and methionine; X64 is selected from 
group consisting of aspartic acid and glutamic acid; X 67 is tryptophan; X 75 is 
selected from group consisting of tyrosine and tryptophan, and X 72 and X 79 
are cysteines; ii) the subsequences are oriented Formula 6 to Formula 1; 
and iii) with the proviso that the amino acid sequence is not insulin, insulin- 
like growth factor, an anti-insulin receptor antibody, an anti-insulin-like 
growth factor receptor antibody, or fragments thereof. 

35. The method of claim 34, wherein the amino acid sequence modulates 
insulin-like growth factor receptor activity in mammalian cells selected from 
the group consisting of breast cancer cells, pancreatic cancer cells, and 
myeloid cells. 
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36. The method of claim 35, wherein the amino acid sequence comprises 
an insulin-like growth factor receptor antagonist sequence, D8B12-12-RP9 
(SEQ ID NO:2244). 

37. The method of claim 35, wherein the amino acid sequence comprises 
an insulin-like growth factor receptor agonist sequence selected from the 
group consisting of: 

a. D8B12-12-RP9 (SEQ ID NO:2244); and 

b. D81 5-RP9 (SEQ ID NO:2260). 

38. A method of decreasing insulin-like growth factor receptor activity in 
insulin-like growth factor-responsive mammalian cells comprising: 
contacting the cells with an amino acid sequence in an amount sufficient to 
decrease the activity of insulin-like growth factor receptor, wherein the 
amino acid sequence comprises an insulin-like growth factor receptor 
antagonist sequence selected from the group consisting of: 



a. 


H2C-A-H6 (SEQ ID NO:2228); 


b. 


RP9 (SEQ ID NO:2229); 


c. 


C1 (SEQ ID NO:2230); 


d. 


L-RP9ex (SEQ ID NO:2231); 


e. 


RP33-IGF (SEQ ID NO:2232); 


f. 


RP6KK (SEQ ID NO:2233); 


g- 


RP30-IGF (SEQ ID NO:2234); 


h. 


RP43 (SEQ ID NO:2235); 


i. 


RP6 (SEQ ID NO:2236); 


j- 


RP52 (SEQ ID NO:2245); 


k. 


RP54 (SEQ ID NO:2246); 
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I. RP33K-IGF (SEQ ID NO:2266); 

m. C1 KK (SEQ ID NO:2266); and 

n. RP56 (SEQ ID NO:2247). 

39. A method of decreasing insulin-like growth factor receptor activity in 
activity in insulin-like growth factor-responsive mammalian cells comprising: 
contacting the cells with an amino acid sequence in an amount sufficient to 
decrease the activity of insulin-like growth factor receptor, wherein the 
amino acid sequence comprises an insulin-like growth factor receptor 
antagonist sequence selected from the group consisting of: 



a. 


RP9-RP9 (C-C; SEQ ID NO:2237); 


b. 


RP9-RP9 (C-N; SEQ ID NO:2238); 


c. 


RP9-L-RP9 (SEQ ID NO:2239); 


d. 


G33-RP9 (SEQ ID NO:2240); 


e. 


RP30-IGF-12-RP30-IGF (SEQ ID NO:2241); 


f. 


RP9-L-RP6 (SEQ ID NO:2242); 


g- 


G33-D8B12 (SEQ ID NO:2243); and 


h. 


D8B12-12-RP9 (SEQ ID NO:2244). 



40. A method of increasing insulin-like growth factor receptor activity in 
insulin-like growth factor-responsive mammalian cells comprising: 
contacting the cells with an amino acid sequence in an amount sufficient to 
increase the activity of insulin-like growth factor receptor, wherein the amino 
acid sequence comprises an insulin-like growth factor receptor agonist 
sequence selected from the group consisting of: 

a. S175(SEQ ID NO:2248); 

b. G33 (SEQ ID NO:2249); 

c. 12-RP9ex (SEQ ID NO:2250); 
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d. L-RP9ex (SEQ ID NO:2231 ); 

e. D815(SEQ ID NO:2252); 

f. RP31-IGF (SEQ ID NO:2253); 

g. RP48 (SEQ ID NO:2261 ); and 

h. RP60 (SEQ ID NO:2262). 

41. A method of increasing insulin-like growth factor receptor activity in 
insulin-like growth factor-responsive mammalian cells comprising: 
contacting the cells with an amino acid sequence in an amount sufficient to 
increase the activity of insulin-like growth factor receptor, wherein the amino 
acid sequence comprises an insulin-like growth factor receptor agonist 
sequence selected from the group consisting of: 



a. 


RP9-12-RP9 (SEQ ID NO:2254); 


b. 


D112-12-D112(SEQ ID NO:2255); 


c. 


G33-6-G33 (SEQ ID NO:2256); 


d. 


RP9-L-RP6 (SEQ ID NO:2242); 


e. 


D112-12-RP30-IGF (SEQ ID NO:2257); 


f. 


RP9-L-RP6 (SEQ ID NO:2242); 


g- 


RP30-IGF-12-D112 (SEQ ID NO:2258); 


h. 


RP6-RP9 (SEQ ID NO:2259); 


i. 


D8B12-12-RP9 (SEQ ID NO:2244); 


j- 


RP6-L-D8B12 (SEQ ID NO:2263); 


k. 


RP30-IGF-12-RP31-IGF (SEQ ID NO:2264); 


I. 


RP31-IGF-12-RP30-IGF (SEQ ID NO:2265); and 


m. 


D815-RP9 (SEQ ID NO:2260). 
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42. An insulin-like growth factor receptor modulator comprising an amino 
acid sequence which comprises a Formula 1 sequence, X1X2X3X4X5, such 
that X1, X 2 , and X 5 are selected from the group consisting of phenylalanine 
and tyrosine, X 3 is selected from the group consisting of aspartic acid, 
glutamic acid, glycine and serine, and X4 is selected from group consisting 
of tryptophan, tyrosine and phenylalanine, wherein the amino acid sequence 
further comprises two or more cysteines separated by at least 3 amino 
acids; and with the proviso that the amino acid sequence is not insulin, 
insulin-like growth factor, an anti-insulin receptor antibody, an anti-insulin- 
like growth factor receptor antibody, or fragments thereof. 

43. The insulin-like growth factor receptor modulator of claim 42, wherein 
amino acid sequence modulates insulin-like growth factor receptor activity in 
mammalian cells selected from the group consisting of breast cancer cells, 
pancreatic cancer cells, and myeloid cells. 

44. The insulin-like growth factor receptor modulator of claim 43, wherein 
the amino acid sequence comprises an insulin-like growth factor receptor 
antagonist sequence selected from the group consisting of: 

a. C1 (SEQ ID NO:2230); 

b. C1 KK (SEQ ID NO:2266); and 

c. L-RP9ex (SEQ ID NO:2231 ). 

45. The insulin-like growth factor receptor modulator of claim 43, wherein 
the amino acid sequence comprises an insulin-like growth factor receptor 
agonist sequence selected from the group consisting of: 

a. G33 (SEQ ID NO:2249); and 

b. L-RP9ex (SEQ ID NO:2231 ). 
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46. An insulin-like growth factor receptor antagonist comprising an amino 
acid sequence which comprises a Formula 2 sequence, 
XgX 7 XqX 9 XioX^Xi 2 X<i3, such that X6 and X 7 are aromatic amino acids, Xs, X 9 , 
X 11t and X 12 are any amino acid, and X 10 and X i3 are leucine, wherein the 
amino acid sequence further comprises two or more cysteines separated by 
at least 3 amino acids, with the proviso that the amino acid sequence is not 
insulin, insulin-like growth factor, an anti-insulin receptor antibody, an anti- 
insulin-like growth factor receptor antibody, or fragments thereof. 

47. The insulin-like growth factor receptor antagonist of claim 46, wherein 
amino acid sequence decreases insulin-like growth factor receptor activity in 
mammalian cells selected from the group consisting of breast cancer cells, 
pancreatic cancer cells, and myeloid cells. 

48. The insulin-like growth factor receptor antagonist of claim 47, wherein 
the amino acid sequence comprises a sequence selected from the group 
consisting of: 

a. RP33-IGF (SEQ ID NO:2232); 

b. RP6KK (SEQ ID NO:2233); 

c. RP30-IGF (SEQ ID NO:2234); 

d. RP43 (SEQ ID NO:2235); 

e. RP33K-IGF (SEQ ID NO:2266); and 

f. RP6 (SEQ ID NO:2236). 
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49. An insulin-like growth factor receptor modulator comprising an amino 
acid sequence which comprises at least two Formula 1 subsequences, 
X1X2X3X4X5, such that X1, X 2 , and X 5 are selected from the group consisting 
of phenylalanine and tyrosine, X 3 is selected from the group consisting of 
aspartic acid, glutamic acid, glycine and serine, and X4 is selected from 
group consisting of tryptophan, tyrosine and phenylalanine, wherein the 
amino acid sequence further comprises two or more cysteines separated by 
at least 3 amino acids, with the proviso that the amino acid sequence is not 
insulin, insulin-like growth factor, an anti-insulin receptor antibody, an anti- 
insulin-like growth factor receptor antibody, or fragments thereof. 

50. The insulin-like growth factor receptor modulator of claim 49, wherein 
amino acid sequence modulates insulin-like growth factor receptor activity in 
mammalian cells selected from the group consisting of breast cancer cells, 
pancreatic cancer cells, and myeloid cells. 

51 . The insulin-like growth factor receptor modulator of claim 50, wherein 
the amino acid sequence comprises an insulin-like growth factor receptor 
antagonist sequence, G33-RP9 (SEQ ID NO:2240). 

52. The insulin-like growth factor receptor modulator of claim 50, wherein 
the amino acid sequence comprises an insulin-like growth factor receptor 
agonist sequence selected from the group consisting of: 

a. D112-12-D112 (SEQ ID NO:2255); and 

b. G33-6-G33 (SEQ ID NO:2256). 
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53. An insulin-like growth factor receptor antagonist comprising an amino 
acid sequence that comprises at least two Formula 2 subsequences, 
XeX/XsXgXioXuX^Xis, such that Xe and X 7 are aromatic amino acids, Xa, X 9 , 
Xn, and Xi 2 are any amino acid, and X 10 and X13 are leucine, wherein the 
amino acid sequence further comprises two or more cysteines separated by 
at least 3 amino acids, with the proviso that the amino acid sequence is not 
insulin, insulin-like growth factor, an anti-insulin receptor antibody, an anti- 
insulin-like growth factor receptor antibody, or fragments thereof. 

54. The insulin-like growth factor receptor antagonist of claim 53, wherein 
amino acid sequence decreases insulin-like growth factor receptor activity in 
mammalian cells selected from the group consisting of breast cancer cells, 
pancreatic cancer cells, and myeloid cells. 

55. The insulin-like growth factor receptor antagonist of claim 54, wherein 
amino acid sequence comprises the sequence RP30-IGF-12-RP30-IGF 
(SEQ ID NO:2241). 

56. An insulin-like growth factor receptor modulator comprising an amino 
acid sequence which comprises a Formula 1 subsequence X1X2X3X4X5, and 
a Formula 2 subsequence, XeX/XsXgXioXuX^Xia, such that X1, X 2 , and X 5 
are selected from the group consisting of phenylalanine and tyrosine, X 3 is 
selected from the group consisting of aspartic acid, glutamic acid, glycine 
and serine, and X4 is selected from group consisting of tryptophan, tyrosine 
and phenylalanine, and such that X 6 and X 7 are aromatic amino acids, X 8 , 
X 9 , Xii, and X12 are any amino acid, and X10 and X13 are leucine, wherein 
the amino acid sequence further comprises two or more cysteines separated 
by at least 3 amino acids, and wherein the subsequences are oriented 
Formula 1 to Formula 2, with the proviso that the amino acid sequence is 
not insulin, insulin-like growth factor, an anti-insulin receptor antibody, an 
anti-insulin-like growth factor receptor antibody, or fragments thereof. 
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57. The insulin-like growth factor receptor modulator of claim 56, wherein 
amino acid sequence modulates insulin-like growth factor receptor activity in 
mammalian cells selected from the group consisting of breast cancer cells, 
pancreatic cancer cells, and myeloid cells. 

58. The insulin-like growth factor receptor modulator of claim 57, wherein 
the amino acid sequence comprises an insulin-like growth factor receptor 
antagonist sequence, RP9-L-RP6 (SEQ ID NO:2242). 

59. The insulin-like growth factor receptor modulator of claim 57, wherein 
the amino acid sequence comprises an insulin-like growth factor receptor 
agonist sequence selected from the group consisting of: 

a. RP9-L-RP6 (SEQ ID NO:2242); and 

b. D1 12-12-RP30-IGF (SEQ ID NO:2257). 

60. An insulin-like growth factor receptor agonist comprising an amino 
acid sequence which comprises a Formula 1 subsequence X 1 X 2 X3X 4 X5 I and 
a Formula 2 subsequence, XeXjWsXioX^X^z, such that X 1f X 2 , and X 5 
are selected from the group consisting of phenylalanine and tyrosine, X 3 is 
selected from the group consisting of aspartic acid, glutamic acid, glycine 
and serine, and X4 is selected from group consisting of tryptophan, tyrosine 
and phenylalanine, and such that X 6 and X 7 are aromatic amino acids, X 8 , 
X 9 , Xn t and X 12 are any amino acid, and X10 and X13 are leucine, wherein 
the amino acid sequence further comprises two or more cysteines separated 
by at least 3 amino acids, and wherein the subsequences are oriented 
Formula 2 to Formula 1, with the proviso that the amino acid sequence is 
not insulin, insulin-like growth factor, an anti-insulin receptor antibody, an 
anti-insulin-like growth factor receptor antibody, or fragments thereof. 
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61. The insulin-like growth factor receptor agonist of claim 60, wherein 
amino acid sequence modulates insulin-like growth factor receptor activity in 
mammalian cells selected from the group consisting of breast cancer cells, 
pancreatic cancer cells, and myeloid cells. 

62. The insulin-like growth factor receptor agonist of claim 61, wherein 
the amino acid sequence comprises a sequence selected from the group 
consisting of: 

a. RP30-IGF-12-D112 (SEQ ID NO:2258); and 

b. RP6-RP9 (SEQ ID NO:2259). 

63. An insulin-like growth factor receptor antagonist comprising an amino 
acid sequence selected from the group consisting of: 



a. 


H2C-A-H6 (SEQ ID NO:2228); 


b. 


L-RP9ex (SEQ ID NO:2231); 


c. 


RP33-IGF (SEQ ID NO:2232); 


d. 


RP30-IGF (SEQ ID NO:2234); 


e. 


RP43 (SEQ ID NO:2235); 


f. 


G33-RP9 (SEQ ID NO:2240); 


g- 


RP30-IGF-12-RP30-IGF (SEQ ID NO:2241); 


h. 


RP9-L-RP6 (SEQ ID NO:2242); 


i. 


G33-D8B12 (SEQ ID NO:2243); 


j 


D8B12-12-RP9 (SEQ ID NO:2244); 


k. 


RP52 (SEQ ID NO:2245); 


I. 


RP54 (SEQ ID NO:2246); 


m. 


RP33K-IGF (SEQ ID NO:2266); and 


n. 


RP56 (SEQ ID NO:2247). 
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64. An insulin-like growth factor receptor agonist comprising an amino 
acid sequence selected from the group consisting of: 



a. 


12-RP9ex (SEQ ID NO:2250); 


b. 


L-RP9ex (SEQ ID NO:2231); 


c. 


RP31-IGF (SEQ ID NO:2253); 


d. 


D112-12-D112 (SEQ ID NO:2255); 


e. 


G33-6-G33 (SEQ ID NO:2256); 


f. 


RP9-L-RP6 (SEQ ID NO:2242); 


9- 


D112-12-RP30-IGF (SEQ ID NO:2257); 


h. 


RP30-IGF-12-D112 (SEQ ID NO:2258); 


i. 


RP6-RP9 (SEQ ID NO:2259); 


j- 


D8B12-12-RP9 (SEQ ID NO:2244); 


k. 


D815-RP9 (SEQ ID NO:2260); 


I. 


RP48 (SEQ ID NO:2261); and 


m. 


RP60 (SEQ ID NO:2262). 



65. A method of identifying an insulin-like growth factor receptor 
modulator comprising: 
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a. contacting insulin-like growth factor receptor with an amino 
acid sequence to form a complex, wherein i) the amino acid sequence 
comprises a Formula 1 sequence, X 1 X 2 X 3 X4X5, such that X if X 2 , and X 5 are 
selected from the group consisting of phenylalanine and tyrosine, X 3 is 
selected from the group consisting of aspartic acid, glutamic acid, glycine 
and serine, and X4 is selected from group consisting of tryptophan, tyrosine 
and phenylalanine; ii) the amino acid sequence further comprises two or 
more cysteines separated by at least 3 amino acids; and iii) with the proviso 
that the amino acid sequence is not insulin, insulin-like growth factor, an 
anti-insulin receptor antibody, an anti-insulin-like growth factor receptor 
antibody, or fragments thereof; 

b. contacting the complex of (a) with a compound library; 

c. identifying a compound which disrupts the complex of (a); and 

d. determining whether the compound exhibits agonist or 
antagonist activity at insulin-like growth factor receptor, wherein this activity 
indicates identification of an insulin-like growth factor receptor modulator. 

66. The method of claim 65, wherein the amino acid sequence comprises 
a sequence selected from the group consisting of: 

a. C1 (SEQ ID NO:2230); 

b. L-RP9ex (SEQ ID NO:2231 ); 

c. G33 (SEQ ID NO:2249); 

d. C1 KK (SEQ ID NO:2266); and 

e. L-RP9ex (SEQ ID NO:2231 ). 

67. A method of identifying an insulin-like growth factor receptor 
modulator comprising: 
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a. contacting insulin-like growth factor receptor with an amino 
acid sequence to form a complex, wherein i) the amino acid sequence 
comprises a Formula 2 sequence, XeXyXeXgXioXuX^X^, such that Xe and 
X 7 are aromatic amino acids, Xa, X 9 , X 11t and X i2 are any amino acid, and 
X i0 and X i3 are leucine; ii) the amino acid sequence further comprises two 
or more cysteines separated by at least 3 amino acids; and iii) with the 
proviso that the amino acid sequence is not insulin, insulin-like growth 
factor, an anti-insulin receptor antibody, an anti-insulin-like growth factor 
receptor antibody, or fragments thereof; 

b. contacting the complex of (a) with a compound library; 

c. identifying a compound which disrupts the complex of (a); and 

d. determining whether the compound exhibits agonist or 
antagonist activity at insulin-like growth factor receptor, wherein this activity 
indicates identification of an insulin-like growth factor receptor modulator. 

68. The method of claim 67, wherein the amino acid sequence comprises 
a sequence selected from the group consisting of: 



a. 


RP33-IGF (SEQ ID NO:2232); 


b. 


RP6KK (SEQ ID NO:2233); 


c. 


RP30-IGF (SEQ ID NO:2234); 


d. 


RP43 (SEQ ID NO:2235); 


e. 


RP33K-IGF (SEQ ID NO:2266); and 


f. 


RP6 (SEQ ID NO:2236). 



69. A method of identifying an insulin-like growth factor receptor 
modulator comprising: 
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a. contacting insulin-like growth factor receptor with an amino 
acid sequence to form a complex, wherein i) the amino acid sequence 
comprises a Formula 6 sequence, Xq 2 X 63 Xe4 Xes X 66 Xe 7 Xe 8 X 69 X 70 X71 X 72 
X 73 X 74 X 75 X 76 X 77 X 78 X 79 Xbo Xai, such that X 6 2, X 6 s, X66 X68, X69, X 71 , X 73 , 
X 76 , X 77 , X 7 a, Xao and Xei are any amino acid, X63 is selected from the group 
consisting of leucine, isoleucine, methionine and valine; X 70 and X 74 are 
selected from group consisting of valine, isoleucine, leucine and methionine; 
X64 is selected from group consisting of aspartic acid and glutamic acid; X 67 
is tryptophan; X 75 is selected from group consisting of tyrosine and 
tryptophan, and X 72 and X 79 are cysteines; and ii) with the proviso that the 
amino acid sequence is not insulin, insulin-like growth factor, an anti-insulin 
receptor antibody, an anti-insulin-like growth factor receptor antibody, or 
fragments thereof; 

b. contacting the complex of (a) with a compound library; 

c. identifying a compound which disrupts the complex of (a); and 

d. determining whether the compound exhibits agonist or 
antagonist activity at insulin-like growth factor receptor, wherein this activity 
indicates identification of an insulin-like growth factor receptor modulator. 

70. The method of claim 69, wherein the amino acid sequence comprises 
a sequence selected from the group consisting of: 

a. D81 5 (SEQ ID NO:2252); and 

b. RP31-IGF (SEQ ID NO:2253). 

71. A method of identifying an insulin-like growth factor receptor 
modulator comprising: 

a. contacting insulin-like growth factor receptor with an amino 
acid sequence to form a complex, 

b. contacting the complex of (a) with a compound library; 
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c. identifying a compound which disrupts the complex of (a); and 

d. determining whether the compound exhibits agonist or 
antagonist activity at insulin-like growth factor receptor, wherein this activity 
indicates identification of an insulin-like growth factor receptor modulator, 
and wherein the amino acid sequence comprises a sequence selected from 
the group consisting of: 



a. 


H2C-A-H6 (SEQ ID NO:2228); 


b. 


RP9 (SEQ ID NO:2229); 


c. 


C1 (SEQ ID NO:2230); 


d. 


L-RP9ex (SEQ ID NO:2231); 


e. 


RP33-IGF (SEQ ID NO:2232); 


f. 


RP6KK (SEQ ID NO:2233); 


g. 


RP30-IGF (SEQ ID NO:2234); 


h. 


RP43 (SEQ ID NO:2235); 


i. 


Kro (obU IU NU.^oD), 


j- 


RP9-RP9 (C-C; SEQ ID NO:2237); 


k. 


RP9-RP9 (C-N; SEQ ID NO:2238); 


1. 


RP9-L-RP9 (SEQ ID NO:2239); 


m. 


G33-RP9 (SEQ ID NO:2240); 


n. 


RP30-IGF-12-RP30-IGF (SEQ ID NO:2241); 


o. 


RP9-L-RP6 (SEQ ID NO:2242); 


P- 


G33-D8B12 (SEQ ID NO:2243); 


q- 


D8B12-12-RP9 (SEQ ID NO:2244); 


r. 


RP52 (SEQ ID NO:2245); 
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s. RP54 (SEQ ID NO:2246); 

t. RP33K-IGF (SEQ ID NO:2266); 

u. C1 KK (SEQ ID NO:2266); and 

v. RP56 (SEQ ID NO:2247). 

72. A method of identifying an insulin-like growth factor receptor 
modulator comprising: 

a. contacting insulin-like growth factor receptor with an amino 
acid sequence to form a complex, 

b. contacting the complex of (a) with a compound library; 

c. identifying a compound which disrupts the complex of (a); and 

d. determining whether the compound exhibits agonist or 
antagonist activity at insulin-like growth factor receptor, wherein this activity 
indicates identification of an insulin-like growth factor receptor modulator, 
and wherein the amino acid sequence comprises a sequence selected from 
the group consisting of: 



a. 


S175 (SEQ ID NO:2248); 


b. 


G33 (SEQ ID NO:2249); 


c. 


12-RP9ex (SEQ ID NO:2250); 


d. 


L-RP9ex (SEQ ID NO:2231); 


e. 


D815(SEQ ID NO:2252); 


f. 


RP31-IGF (SEQ ID NO:2253); 


g- 


RP9-12-RP9 (SEQ ID NO:2254); 


h. 


D112-12-D112 (SEQ ID NO:2255); 


i. 


G33-6-G33 (SEQ ID NO:2256); 


j- 


RP9-L-RP6 (SEQ ID NO:2242); 
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k. 


D112-12-RP30-IGF (SEQ ID NO:2257); 


1. 


RP9-L-RP6 (SEQ ID NO:2242); 


m. 


RP30-IGF-12-D112 (SEQ ID NO:2258); 


n. 


RP6-RP9 (SEQ ID NO:2259); 


o. 


D8B12-12-RP9 (SEQ ID NO:2244); 


P- 


D815-RP9 (SEQ ID NO:2260); 


r 
1 . 




s. 


RP6-L-D8B12 (SEQ ID NO:2263); 


t. 


RP30-IGF-12-RP31-IGF (SEQ ID NO:2264); 


u. 


RP31-IGF-12-RP30-IGF (SEQ ID NO:2265); and 


V. 


RP60 (SEQ ID NO:2262). 



73. A method of identifying an insulin-like growth factor receptor 
modulator comprising: 

a. screening a library of amino acid sequences to isolate an 
amino acid sequence that binds to an insulin-like growth factor receptor, 
wherein the library is derived from a peptide sequence comprising at least 
one formula sequence selected from the group consisting of Formula 1, 
Formula 2, and Formula 6; and 

b. determining whether the amino acid sequence isolated in (a) 
exhibits agonist or antagonist activity at insulin-like growth factor receptor in 
an insulin-like growth factor-responsive cell selected from the group 
consisting of FDC-P2, MCF-7, and MiaPaCa, wherein this activity indicates 
identification of an insulin-like growth factor receptor modulator. 
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74. A method of enhancing survival of an insulin-like growth factor- 
responsive mammalian cell comprising: contacting the cell with an amino 
acid sequence in an amount sufficient to enhance the survival of the cell, 
wherein the amino acid sequence is an insulin-like growth factor receptor 
agonist comprising at least one formula sequence selected from the group 
consisting of Formula 1, Formula 2, and Formula 6. 
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Peptide or Dimer (log M) 
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